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Some Audio Considerations in Air Control Towers 


Joun C. Wesster ann Pavut O. THompson 
Human Factors Division, U. S. Navy Electronics Laboratory, San Diego, California 


Speech intelligibility tests were constructed by including within the framework of standard 
airplane-tower phraseologies standard multiple-choice word lists. In a laboratory room two 
mock control towers were constructed, one having six individual loudspeakers for each of six 
potential message sources, the other having but a single loudspeaker for the same six sources. 
In each mock tower the speaker array could be placed on the ceiling behind the operator’s head, 
on the wall-ceiling angle in front of the operator, or at head height in front of the seated operator. 
Other experimental variables included signal level, noise level, and the density of messages per 
message source. Each tower was manned by a series of actual air-controllers operating in pairs. 
These pairs of operators listened to the same recorded materials at their respective towers and 
responded orally to all messages. The speed and accuracy of their responses were the test criteria. 
The results indicate that: six loudspeakers serve better than one for monitoring six nearly saturated 
message sources; the positioning of the loudspeaker array is not critical; ambient noise levels of 
70 db cause no appreciable decrement in performance, but noise levels of 80 db and greater do. 


INTRODUCTION 


USY CONTROL towers may have from four to six men 
monitoring up to fifteen different radio channels. In 
addition to this, information is passed to and from the tower 
over telephones, squawk boxes, and signal lights. In some 
cases pneumatic tubes are used, and some towers even 
receive visual information by radar. In any situation of 
this type where information is received, relayed, acted upon, 
and transmitted and where the cost of an error is so imme- 
diate and tragic, tensions build up and noises which are 
inevitable become highly annoying and dangerous. 

The masking of the incoming auditory signals is the most 
obvious effect of acoustic noise. The fatigue due to listening 
to marginally intelligible speech above the noise is a sec- 
ondary effect. The noises are of three main types: (1) the 
airborne noise of airplanes warming up, taxiing, taking off, 


and landing; (2) the noise that accompanies all incoming 
radio messages as static and the transmitted cabin noises; 
and (3) the noise of competing messages either incoming or 
outgoing, that is, other voice messages (chatter). 

The present laboratory study was undertaken to deter- 
mine the word intelligibility of a mock communications 
system in the presence of various levels of “chatter” and 
“static.” The number and placement of the loudspeakers 
used to terminate the message sources was studied. Also 
studied was the use of a “pulldown” facility which allowed 
any or all messages to be switched from the regular loud- 
speaker array to a loudspeaker (or earphone) located as 
close as possible to the listener. 


PROCEDURE AND INSTRUMENTATION 


The actual control tower situation with four to six men 
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Fic. 1. Pieture of mock control tower 1. This tower has an in- 
dividual speaker for each potential message source. The speaker 
array may be located in front of the operator (as shown) or on the 
wall-ceiling angle, or above and behind the operator. At the top 
of the console are the 0-40-db attenuators, below these are the 
pulldown switches, and along the bottom are the transmitter se- 
lector switches. (Official photograph, U. 8. Navy.) 


monitoring up to fifteen channels is far too complex to bring 
into the laboratory for a study of the human factors in audi- 
tory perception. In this particular laboratory study, there- 
fore, simplifications were made in the number of messages, 
the number of men listening, and the type of message. Six 
channels were used, four of them carrying test messages and 
two carrying distraction material (chatter). The test mes- 
sages were recorded on magnetic film, and in one case were 
equally distributed among the four channels (0.25 + 0.25 
+ 0.25 + 0.25), and in a second case were unequally dis- 
tributed among the channels (0.70 + 0.20 + 0.05 + 0.05). 
Pairs of operators were used, one operator at each of two 
mock towers. The recorded material was branched so that 
identical messages appeared simultaneously at each of the 
two mock towers. 


A. The Mock Towers 
What is hereafter referred to as mock tower 1 is shown 
in Fig. 1. It consisted of six individual Jensen Speech- 
master loudspeakers mounted in a row with a light on each 


one. These lights were activated by an incoming message 
(sound-powered relays). The switching console consisted 
of an attenuator for each channel, a pulldown switch, and a 
transmitter selector switch. The pulldown switch allowed 
any or all messages to be pulled out of the Speechmaster 
speakers into a WE755 speaker which was located close to 
the operator. The operator was instructed to respond to 
each message on the channel from which it was received. 
He selected this channel by means of the transmitter selector 
switches. The six-speaker array could be mounted in front 
of the operator (as shown), or on the wall-ceiling angle, or 
above and behind the operator. No provision was made for 
increasing the spacing between the speakers. 

Mock tower 2 is shown in Fig. 2. In this case all six 
channels fed into a single 8-in. (755A Western Electric) 
speaker. The indicating lights were mounted on the switch- 
ing console. The attenuators were 6-db pads. The pull- 
down switches allowed the shifting of any or all messages 
from the loudspeaker to the earphone. The transmitter 
selector switches served the same purpose as on tower 1. 


" OVERHEAD—BEHIND 
“p——LOSITION_ 


~CEILING ANGLE ie 
POSITION 


ii FRONT POSITION 


Fic. 2. Picture of mock control tower 2. In this tower all chan- 
nels parallel into a single speaker. This speaker may be located in 
front of the operator (as shown), on the wall-ceiling angle, or 
above and behind the operator. At the top of the console are the 
indicator lights that light up when a message comes in. Below 
these are the attenuator pads (0-6 db), the pulldown switches, and 
the transmitter selector switches. (Official photograph, U.S. Navy.) 
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This loudspeaker could also be placed in front of, above and 
ahead, or above and behind the operator. 

These two mock towers represent the simplifications made 
in the physical audio-termination equipment in order to 
study control tower auditory perception under laboratory 
conditions. 


B. The Test Message Material 


Simplifications were also required in the types of messages 
used. Without the use of motion picture sequences, situa- 
tional conditions could not be easily synthesized. Even if 
some such elaborate procedure had been evolved, control 
tower language is so redundant as to make a speech intelligi- 
bility study highly inefficient if standard messages are used. 

Control tower language has been estimated to be 96% 
redundant.' This being so, only a few words of a message 
need be intelligible in order for the whole message to be un- 
derstood. It follows, therefore, that the remaining words 
add nothing (except a higher degree of certainty) to the 
message and merely waste time in an evaluation. (It is 
recognized, of course, that this redundancy is necessary in 
actual towers since the word intelligibility of their communi- 
cation circuits is very low and incorrectly understood mes- 
sages cannot be tolerated in the interest of safety.) 

In order to get an efficient test for word intelligibility three 
unrelated words were incorporated into standard airplane- 
tower phraseologies. Actual air control tower operators 
listened to these messages and repeated back the key test 
words, again in the standard phraseology. These responses 
were checked for correctness by listeners and were recorded 
on magnetic tape for future rechecking. Figure 3 shows two 
typical incoming messages and the correct responses. 


MILITARY TOWER 
Transmission 
Navy North Island Tower 
This is Navy 461 
System, Foam, Banner 


CIVILIAN TOWER 


Lindberg Tower 
This is United 629 
Arch, Sponsor, Gage 


Over Over 
Response 
Navy 461 United 629 
System, Foam, Banner Arch, Sponsor, Gage 
Over Over 


Fic. 3. Example of test materials and the correct responses. 


C. The Testing 


Enough recorded material was made to allow testing over 
a 6-hour period (one day). The tests were repeated for 
twenty-five days to five pairs of experienced control tower 
operators, each pair listening for five days. 

By properly counterbalancing the sequence in which the 


1F. C. Frick and W. H. Sumby, Control Tower Language, J. 


Acoust. Soc. Amer., 24, 595-596 (1952). 
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Fic. 4. Percentage of words correct for each of the five test 
days, for three pairs of operators. Within each day comparisons 
are made between the operators, the distribution of messages per 
channel, the towers, and the use of the pulldown facility. See the 
text for further details. 


recorded material was administered, the rotation plan of the 
operators between the towers, and the use of the pulldown 
facility, balanced comparison could be made each day be- 
tween the operators, the towers, the message densities per 
channel, and the use or non-use of the pulldown facility. On 
a day-to-day basis the position of the loudspeaker array was 
varied or the noise level was varied. The noise was of two 
types: a thermal noise that simulated vhf static, and the 
actual control tower “chatter” of channel 5. 


RESULTS 


Figure 4 shows the percentage of words correct for each 
of the five test days, for three of the five pairs of operators. 
The solid horizontal line for each operator-pair and for each 
day shows the average percentage of correct words for that 
day. Attending for the moment to these lines only, it is 
apparent that there are pronounced differences among the 
operator-pairs. Operators 1 and 2 grew up with the experi- 
ment and had two weeks of practice before the actual tests 
started. In addition, they listened to the four test channels 
in the absence of either vhf static (TN) or chatter for the 
first three test days. When chatter was added on the fourth 
day, the accuracy of their performance dropped off 6%, and, 
when noise was added to the chatter on the fifth day, their 
accuracy dropped another 4%. The loudspeaker array was 
on the wall-ceiling angle ( 2.) for operators 1 and 2 on their 
first test day. On the second day the array was overhead 
and behind (7), and on the third day the array was in front 
of them (F). 

Operators 3 and 4 had no previous practice, except on the 
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special practice reel provided, before their series of tests. 
Of the sixteen reels constituting test days 1 and 2, eight were 
heard in the presence of thermal noise (at a level of 70 db, 
overall, flat filter position) and eight in the presence of 
chatter. The improvement between days 1 and 2 is quite 
marked, and they might well have reached the level of at- 
tainment of operators 1 and 2 if the task had not been made 
considerably more difficult on test days 3, 4, and 5 by the 
inclusion of both thermal noise and chatter. Notice that 
the speaker array was also moved for them on test days 3, 
4, and 5. 

Operators 7 and 8 listened to only eight reels (plus the 
practice reel) on their first day, four in the presence of TN, 
and four in the presence of chatter. For the remaining days 
they listened in the presence of both TN and chatter. As 
can be seen, they did not show the expected increase (learn- 
ing) between days 1 and 2, and the almost inevitable con- 
clusion is that this anticipated increase was entirely can- 
celled out by the addition of the TN and chatter. 

The speaker array was also moved between days for 
operators 7 and 8. When the results for all three pairs of 
operators are averaged, the overhead position of the speaker 
array turns out to be the best and the angle position the 
worst. Even though the order of speaker-array orientation 
was counterbalanced among the operator-pairs, however, the 
sequence effects are still important. When these sequence 
effects are taken into account, there is no statistically sig- 
nificant difference between the three speaker-array locations. 

Also shown on Fig. 4 are the differences among operators, 
towers, pulldown conditions, and the message density per 
channel. These differences are shown by the deviation of 
the results of operator 1 (or 3 or 7), message density 1 
(0.70-0.20—0.05—0.05), tower 1 (six speakers), and pull- 
down (yes) from the average for the day. It is understood, 
but not plotted, that the results of operator 2, tower 2, etc., 
deviate an equal but opposite amount from the average. 
Any deviations above the horizontal (average) show operator 
1, message density 1, tower 1, and pulldown (yes) to be 
superior to their counterparts. A deviation below the line 
shows the reverse. 

The difference among operators is not particularly per- 
tinent and is appreciable only between operators 7 and 8, 
where operator 7 gets progressively better than operator 8 
as the week progresses. 

Better scores are quite universally obtained from the 
material that had 70% of the messages concentrated in a 
single channel. This is a statistically significant difference. 
The difference is accounted for in part by the fact that, when 
pulldown was used, 70% of the words on the 70—-20—5-—5 
material was heard from the more advantageous pulldown 
position instead of only 25% as on the 25—25-25-25 ma- 
terial. If this were the whole explanation, then each time 
the use of pulldown showed a great advantage on Fig. 4 the 
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Fic. 5. The relationship between the percentage of words correct 
for two pairs of operators and the amount and type of noise. 


advantage of message density 1 (70-20-5—5) would also be 
large; and this is not the case. 

By borrowing some concepts from information theory, 
it can be shown that the equal distribution of messages 
among channels presents the greater amount of information. 
Or, conversely, the unequal distribution is more redundant 
or easier. The simple fact seems to be that the additional 
uncertainty as to just where the next message is coming 
from and therefore which transmitter selector switch to push 
(in the equally distributed case) is enough to cause the 
operator to miss more words. In this case a more complex 
switching and selective listening job increases the difficulty 
of the aural task and increases the number of errors in re- 
peating back the words heard. 

The fifth pair of operators was used to get some additional 
data on the effects of noise and on the advantage of using a 
constant-level limiter amplifier. The particular amplifier 
used operated on an audio AVC principle and was designed 
to take any input from —32 to 0 db VU and put out a 
constant level at any point from 0 to +8 db VU. It didn’t 
quite live up to its design, but it did level signals out over a 
24-db range. The slow attack time of the AVC caused the 
initial speech sound to be much louder than the rest of the 
message, especially if the message was originally at a loud 
level. 

The results of the fifth pair of operators, as well as com- 
parable results for the first pair of operators, are shown in 
Fig. 5. In this case only a single score is taken for each 
tower for each noise condition. For operators 1 and 2 the 
results shown in Fig. 5 for the case of no thermal noise and no 
chatter (condition 1) is the average of test days 1, 2, and 3, 
as shown on Fig. 4. Similarly, the results of noise conditions 
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3 and 4 are their results for the test days 4 and 5, respec- 
tively. The results of operators 9 and 10 show that the 
chatter (3) is more damaging to word intelligibility than the 
thermal noise (2). The results of operators 1 and 2 and 
9 and 10 show that, if chatter is already present (3), the 
addition of 70 db of thermal noise causes no further per- 
formance decrement, at least in tower 1 (six speakers) and 
when the constant-level amplifier was not used. The thermal 
noise and the speech messages were introduced into the room 
through the same loudspeakers. (This simulates the con- 
dition in actual control towers where the main source of 
noise is the vhf static that accompanies the messages.) 
When the constant-level amplifiers were used, the louder 
speech messages were limited and the softer ones were 
amplified. This worked to the advantage of the operators 
when chatter was the only distraction (3), evidently because 
the chatter itself was limited. In the presence of thermal 
noise, however, the constant-level amplifiers did more harm 
than good (2), especially when both chatter and TN were 
used simultaneously (4). This is undoubtedly due to the 


fact that the thermal noise and the speech messages were 
introduced into the room through the same loudspeaker. 
The limiting was done (unrealistically) before the speech 
was mixed with the noise, and therefore the speech-to-noise 
ratio suffered somewhat 


In any case, increasing the level of noise and chatter by 


10 or 15 db caused additional and quite marked decrements 
of performance. In these latter two tests of higher-level 
noise the operator at tower 1 was given complete control of 
his attenuators and pulldown switches and told to do any- 
thing to aid his performance. He finally hit on the idea of 
pulling the chatter into the pulldown speaker and then 
attenuating it 40 db (condition 6 with the constant-level 
amplifier in). This amounts to a “thermal noise only” 
condition and emphasizes again the deleterious effects of 
chatter, since without effective chatter his score was very 
good. 

The general conclusion to be drawn from Fig. 5 is that 
meaningful speech material at a peak level equal to the peak 
level of the speech under test is more deleterious to word 
intelligibility than thermal noise (similar to vhf static) at a 
level of 70 db (overall level, flat filter position). This level 
of 70 db is the minimum level of vhf static background found 
in a number of actual control towers. As the thermal noise 
level increases above this level, with the consequent increase 
in speech level, the word intelligibility drops quite rapidly. 
(In this study the experimenter raised all voice channels by 
an amount equal to the increase in noise. In an actual tower 
the operators would tend to do the same thing, i.e., raise the 
signal strength when the noise increases.) In interpreting 
these results it should be remembered that the thermal 
noise used had a relatively flat spectrum, and that the inter- 
fering effect of noise on speech depends, not on the overall 


level of the noise, but on its level in the frequency bands 
which are most important to speech intelligibility, namely 
the region between 500 and 4,000 cps. If the limiting noise 
in an actual control tower is aircraft noise which normally 
has a spectrum sloping off toward the high frequencies, a 
somewhat higher noise level than 70 db would be tolerable. 


SUMMARY OF RESULTS 


In this laboratory study of the audio factors in air control 
towers it was found that higher word intelligibility resulted 
under the following conditions: 


1. A pulldown loudspeaker (or earphone) was used. 
2. The majority of the messages originated from only a 
few of the possible message sources. 


3. A separate loudspeaker was used on each of the mes- 
sage sources. 


4. The distraction was thermal noise (overall level of 70 
db) instead of control tower chatter. 


It was also found that operators differ considerably in 
their ability to listen to and repeat back the correct test 
words and also that, as long as chatter is present, 70 db of 
thermal noise is acceptable, although 80 db and above causes 
appreciable decrements in performance. 

In an effort to check the validity of certain of these results 
in an actual tower, the following features are being included 
in the new tower at the San Diego Naval Air Station (North 
Island) : 


1. A separate pulldown speaker is being provided for both 
the ground controller and the local controller. 

2. An insert-type earphone pulldown is also provided as 
optional equipment at each position for times of extreme 
noise. 

3. In addition to these two pulldown speakers, three over- 
head speakers will be provided. These five speakers may 
monitor up to fifteen channels. This is permissible, since an 
estimated 90% of the messages come in on no more than two 
channels, and these will ordinarily be pulled down. This 
leaves the three overhead speakers for the remaining 10% 
of the messages. 

4. The flight data position (information from telephones 
and/or squawk boxes) is moved from the front area of the 
tower and faces the rear windows. This increases the front 
area visibility, reduces chatter, and yet provides easy in- 
formation exchange, since by rotating 90° in his chair the 
man at this position can still talk to or hand information to 
either of the men at the other two positions. 

5. Push-to-talk telephones are provided so that ambient 
noises will not mask out incoming telephone messages be- 
cause of mouthpiece-earpiece feedback (sidetone). 

6. An attempt will be made to keep vhf static noise down 
to 70 db and airborne aircraft noise (primarily jets) down 
to 80 db. 
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History and Development of Stereophonic Sound Recording 


Ross H. Snyper 
Ampex Electric Corporation, Redwood City, California 


The pioneering work on stereophonic reproduction by Dr. Fletcher and the Bell Laboratories 
is described. Consideration is given to the relative importance of frequency range, noise, and 


distortion in monaural vs multichannel reproduction. 


stereophonic reproduction. 


Distinction is made between binaural and 


The realization in practical form of the requirements laid down by 


early research through the use of magnetic tape recording is described. Phase relations are analyzed, 
and requirements for the preservation of three-dimensional effect are defined. 


IHE SEARCH for “facsimile” reproduction of sound 

has gone on for more years than the recent mushroom- 
ing of the high-fidelity industry might at first indicate. 
It is, indeed, startling to realize that facsimile was accom- 
plished as long ago as 1933, although this was under con- 
ditions which would be difficult if not impossible of dupli- 
cation on a commercial scale, even with today’s facilities. 
Facsimile may be said to have been achieved when sub- 
jective judgment by a significant number of persons of 
normal hearing gives no better than chance distribution— 
50% —of correct distinctions between the live, original 
sound and the reproduced sound. 

However acceptable as a substitute for the real thing, 
reproduction which is distinguishable from the original by 
a significant proportion of listeners must be accorded no 
higher rank than high fidelity. 

The requirements for crossing the sharply defined line 
from high fidelity to facsimile were laid down more than 
ten years ago by Dr. Harvey Fletcher.! These were, in 
substance (1) transmission noise significantly below the 
ambient noise in the receiving location, (2) dynamic range 
sufficient to reproduce a level upward of 100 acoustic db? 
peak at the listening chamber, in the case of symphonic 
music, (3) nonlinear distortion due to transmission char- 
acteristics below the level of detectability, (4) frequency 
range equal to or exceeding that of the human hearing 
apparatus, which is limited by the peak listening level, and 
(5) preservation of spatial orientations. 

It is interesting to note in the classical studies which 
emerged from the Bell Laboratories during the late 1920’s 
and 1930’s that ambient noise at the point of listening so 
very commonly exceeds that which is routinely character- 
istic of the auditoriums in which music originates.* In 
the world’s finer auditoriums, it is not uncommon to find 


1 Harvey Fletcher, Hearing, The Determining Factor for High 
Fidelity Transmission, Proc. Inst. Radio Engrs., 30, 266-277 (1942). 

2 Decibels above the reference level 10-16 w/cm?. 

3D. F. Seacord, Room Noise at Subscribers’ Telephone Locations, J. 
Acoust. Soc. Amer., 12, 183-187 (1940); D. F. Hoth, Room Noise 
Spectra at Subscribers’ Telephone Locations, ibid., 12, 499-504 (1940). 


ambient levels as low as 20 acoustic c», and levels of 40 
to 50 are common in the living rooms in which ear is given 
to reproduced music, so that the quieter passages in the 
music may often be expected to be masked by noises en- 
tirely extraneous to the program material or to the trans- 
mission system. 

It is also important to note that in 8 hr of monitoring 
the acoustic levels produced by the Philadelphia Orchestra, 
playing a very wide variety of selections, an acoustic range 
of 65 db was exceeded only twice: once in the tremendous 
crash of sound which occurs toward the end of Stravinsky’s 
Fire Bird, at a point where both the cymbals and the bass 
drum are sounded simultaneously at their maximum loud- 
ness, and once in the midst of an orchestral transcription 
of Debussy’s Clair de Lune, when a long hush comes over 
the whole orchestra. Fletcher estimates that the one great 
crash reaches an acoustic level of 112 db peak, basing his 
estimate upon ‘theoretical development of the evidence con- 
tained in the Sivian, Dunn, and White* papers on the power 
and frequency relationships existing in the output of musi- 
cal instruments, in which momentary intensities of very 
high level were reported producible by these two instru- 
ments. 

The point at which nonlinear distortion becomes per- 
ceptible has been reported in the papers of Dr. Harry 
Olson. Although dependent upon the character of the 
music and of the harmonic structure of the nonlinear dis- 
tortion, it may be taken as somewhat under 1% on a total 
rms basis. Fletcher conclusively demonstrates that the 
frequency range needed for facsimile duplication of sound 
depends firmly upon the levels of listening: when symphonic 
levels do not include the extreme peaks already mentioned, 
but do include the peaks more usually encountered, a range 
of 40 to something short of 15,000 cps is all that the ear 
can detect. This is because the extremes, below 40 and 


4L. J. Sivian, H. K. Dunn, and S. D. White, Absolute Amplitudes 
and Spectra of Certain Musical Instruments and Orchestras, J. Acoust. 
Soc. Amer., 1, 172 (1930); 2, 330-371 (1931). 

5H. F. Olson, Musical Engineering, pp. 345-348, D. Van Nostrand 
Co., New York, 1952. 


176 


f 7 
y y . 
: 
: | i 
Z | =i 
" . ie 
ay 
a ;, 
ie 4 
Oe 7 
ae 
= ee 
wv 
As, 
Po 
ed 
a: 
Dien 
mag ee 
oe 
aims 
is oo OI-?}.\tr—————— 
ie 
a SS §— 
oe 
Ras 
ee 
ay 
ie 
~ 
LA 
ai ' 
@ 
ae 
Be: 
ear 
et 
og 
tae 
Oy 
é ihe 
Me : 
er me ~ 
ee 
Py: 
| . 
pam 
Si ; 
ee 
Ris. 
ec 
7B 
pig 
cy” 
oo 
me 
Og 
dai 
E* 
Al 7 
ma 
wa, 
in 
ioe : 
rs: 
s 
a 
ug 
sak 
rye 
a 
A 
ead 
ant 
ta 
oe 
eae 
‘sak? * 
Py, 
x 
- 
Le 
4 Bs 
is 
hes 
Siu mt 4 " 
tay } 
ie i 
te 
ee i 
bicep | - 
ad 
ag \ 
ats | 
er } 
x: 
fe of } 
eat \ 
“A no 
3 | 
2 ounce 
| ee 
m 
a 
Z | . 
- 
, || | 
‘ 
| 


Fie. 1. A commercial two-channel stereophonic magnetie tape 
recorder, The transport mechanism is in one case; duplicate elec- 
tronie chassis, each with its own controls and volume indicator, 
occupy another case. The third case contains power supplies. 


above 15,000 cps, are audible to persons of normal hear- 
ing only at levels approaching the threshold of pain. 
Fletcher goes so far as to state that “substantially complete” 
reproduction of orchestral music may be accomplished with- 
in the range of 50—8,000 cps. 

The fifth consideration turned out to be crucial: the 
preservation of spatial orientation. However many dupli- 
cate reproducers may be provided as termination for a 
single transmission channel, in order to “spread out” the 
apparent sound source, no important number of observers 
were ever deceived into mistaking the copy for the 
original. In 1933 at the Chicago World’s Fair, however, 
a binaural system, produced under laboratory conditions 
and presented under optimum circumstances, finally achieved 
the desired deception. Carefully fitted ear probes were 
used, of which the listeners could hardly be unaware, but 
the comparison with the live sound was unanimously de- 
clared to be nearly perfect. It remained, in that year, for 
the Bell Laboratories to duplicate the feat with loudspeakers, 
using a system of three rather than two channels, and 
labeled “‘stereophonic.”* The theoretical development for 
the two different systems is reviewed in a Bell System 
monograph of 1934.’ Briefly, binaural and stereophonic 
systems attempt to accomplish somewhat the same ends by 
different means. Binaural transmission assumes the plac- 
ing of a substitute for the human head, containing two 
microphones separated by a sound-absorbent material, at 
some position in the origination chamber at which, it is 
judged, the listener might most like to be. Separate chan- 
nels are then provided for the output of the microphone, all 
the way to the ears of the listener, where the transmission 
is again made audible, still without intermixing the signals; 
this necessitates the use of earphones or, perhaps, standing 
wedged within a soundproof wall, one loudspeaker on each 


6 Elec. Eng., 53, No. 1 (1934) ; six articles. 
7 Bell Telephone System, Monograph B-784, Auditory Perspective, 
Symposium of Six Papers, 1934. 
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side, thus effectively confined to one ear each. 

Stereophonic reproduction proceeds from the theory that, 
if a very large number of microphones were situated in a 
three-dimensional region where sound is originating, each 
with its own transmission system to another location; 
acoustically tdeplical with the originating auditorium, and 
each terminatéd there in an ideal reproducer lécated spatially 
exactly in the same relation to the other reproducers as were 
the original microphones, then a listener at the distant loca- 
tion would experience the same sound sensation as a listener 
at the source. 

The first compromise with this ideal situation is made in 
reducing the number of microphones and channels to a two- 
dimensional plane, so that the whole transmission system 
samples a sort of “curtain” of sound between the audience 
and the orchestra, and the reproducers then reconstitute that 
curtain at the point of listening. Thus all the time and space 
between the origination and audition of the music are made 
transparent for the listener. The second compromise is 
reduction of the two-dimensional curtain to a straight line, 
eliminating those microphones located vertically above the 
others, and eliminating the listener’s ability to distinguish 
if any sounds are meant to originate above the orchestra, a 
compromise which may be considered an acceptable expedi- 
ent in order to reduce the system to practicality. The final 
compromise is that of reducing the number of points on the 
line of origin from indefinitely many, to three, or two per- 
haps, but not to one. Earlier studies have indicated, and 
subsequent experiment has not denied, that the step from 
one channel to two is a major one; and that the step from 
two to three is a major one; but that beyond this point an 
increase in the number of channels is of only marginal sig- 
nificance. The distinction between binaural and stereo- 
phonic in the case of a two-channel system may appear 


Fig. 2. A three-channel stereophonic magnetic tape recorder of 
commercial manufacture. This machine uses %4-in. tape instead of 
the more usual 4-in. width. Successful machines have been built 
which compress three separate channels into the %4-in, width. 
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Fig. 3. The magnetic erase, record, and playback heads of a com- 
mercial two-channel stereophonic tape recorder. The center head 
is actually a stacked pair of record heads; the right head is of the 
same arrangement, but for playback. 


obscure unless the “acoustically transparent curtain” an- 
alogy is kept in mind. Experiments by R. J. Tinkham, in 
cooperation with the University of Illinois,* have resulted 
in a proposal that location of the microphones within the 
theoretical curtain holds the key to successful stereophonic 
duplication, in which the listener at the point of reproduc- 
tion can move about freely within the sound field produced, 
keeping at all times a sense of spatial orientation with rela- 
tion to the unseen orchestra, indeed moving, himself, with 
respect to it while the sound source remains apparently 
stationary. Since the listener wears headphones or is other- 
wise confined in order to remain within the binaural concept, 
the sound source must inevitably appear to move as his head 
moves. Two-channel recordings made with the binaural 
artificial human head device but reproduced by loudspeak- 
ers in a comparatively free sound field, it is reported, pro- 
duce an effect that, although different from the one-channel 
experience, is nevertheless puzzling to the human nervous 
system, which is not conditioned for meaningful interpreta- 
tion of acoustic data presented in this manner. The distinc- 
tion between binaural and stereophonic becomes more readily 
apparent when three channels are presented, of course. 

The early experiments in stereophonic transmission were 
limited to simultaneous origination and audition by the 
character of the then-existing recording systems. 
of preserving the exact phase relations between the two or 
among the three channels were not readily available. Even 
the semistereophonic effects with which the motion picture 
industry experimented in the late 1930’s were made de- 


8R. J. Tinkham, Binaural or Stereophonic?, Audio Eng., 37, 22 
(January 1953). 
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pendent upon careful physical separation of the reproducers, 
with the dominant effect that of amplitude differences among 
the reproducers. The first wholly practical means of re- 
cording separate sound tracks, so that their phase inter- 
relations might be preserved with great precision, appears 
to have been offered with the arrival of magnetic tape re- 
cording systems. Camras, of the Armour Research Foun- 
dation, used this means in experiments on binaural reproduc- 
tion shortly after the war® with notably successful results. 

Although the magnetic tape recording medium offers an 
elegant and simple solution to the problem of preserving 
these phase relations from track to track so long as play- 
back is always on the machine on which the recording was 
made, it is probably well for us to appreciate the degree of 
exactness to which the lateral displacement between the 
two recording or the two reproducing heads must be held, 
from machine to machine, and from which the heads on any 
one machine must not be permitted to depart through their 
service life. If good engineering practices are observed, it 
has been found that the heads may be constructed so that 
the respective gaps are directly above one another, while 
still preserving adequate crosstalk rejection. By this means, 
the twin heads may be constructed as a single unit, and 
their alignment with respect to each other indefinitely as- 
sured. The difference in lateral displacement between the 
heads, from machine to machine, must be held to limits 
which are determined by the degree of exactness with which 


Fic. 4. The degree of precision which is required to line up three 
stacked heads can be judged from this view of a three-channel 
stereophonic head assembly. The stack on the left is for recording; 
that on the right for playback. The machine illustrated was cus- 
tom-engineered to use tape % in. in width, although successful 
machines have been built which place three separate tracks on a 
regular %4-in. tape. 


9 Marvin Camras, A Stereophonic Magnetic Recorder, Proc. Inst. 
Radio Engrs., 37, 440-447 (1949). 
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spatial orientation is to be preserved. It appears that a 
maximum error of 20° may be an acceptable limit to place 
upon this effect; that is to say, if, in playing back on one 
machine a tape made on another, the soloist or the conductor 
may appear to have shifted 20° to the left or right of his 
true position, this error may be regarded as tolerable. The 
human hearing apparatus determines directionality, at least 
in the middle of the audible range, primarily by measuring 
the phase difference between tones due to the difference in 
their time of arrival at the two ears. This is to say, the ear 
measures a time interval. This interval, in the case of tones 
originating a relatively few degrees off dead-ahead, is given 
by the expression 


t = d tan A/v 


where ¢ is the time interval in seconds, d is the distance 
between eardrums (which may be taken as 6 in.), A is the 
angular error, and v is the velocity of sound in inches per 
second. This is approximately 0.165 msec, during which time 
the tape, assuming recording at 15 ips, moves about 2.5 
thousandths of an inch. If, then, the error is not to exceed 
20° in the case of the most extreme combination of differ- 
ences among machines, lateral displacement between each 
of the two heads in both recording and reproducing stacks 
must be held to +£1.25 thousandths. Tolerances of this 


magnitude can be maintained in manufacturing operations 
if extraordinary precautions are observed, so it may be 


assumed that the problem is surmountable. 


Among the important recent experiments with three-chan- 
nel stereophonic recording, those of Lorin D. Grignon, of 


10 Lorin D. Grignon, Experiment in Stereophonic Sound, J. Soc. 
Motion Picture Engrs., 52, 280-292 (1949). 
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Twentieth-Century Fox Studios, are perhaps the best 
documented. Mr. Grignon concludes that stereophonic 
recording eliminates the “boominess” which monaural re- 
cording unnaturally represents some studios to have. This 
reverberation effect, upon which quantitative data are largely 
lacking, is also reported greatly reduced with binaural re- 
cording by David C. Apps of General Motors."' Grignon 
further concludes that phase relation is the largest factor 
in the listeners’ determination of sound directionality, with 
the gain through each channel a secondary factor. Our 
experiments in large measure confirm this, although we find 
indications that phase relations are most important in the 
middle range, while amplitude effects become more and more 
important in determining the source of hf sounds. Another 
of Grignon’s conclusions is that one microphone must be 
placed near any soloist, if clear and definite apparent loca- 
tion is to be preserved, but that there must at all times be 
some pickup on all three channels from both the soloist and 
the accompaniment. 


It may be said, in conclusion, that the ready preserva- 
tion of stereophonic effects has been made economically and 
physically possible by the advent of modern multitrack 
magnetic tape recorders, that the realization of these effects 
may be expected to reach the conditions of facsimile repro- 
duction, if known and attainable considerations are met, 
and, finally, that the applications of stereophonic recording 
are by no means limited to the preservation of music, but 
may be expected to have profound effect upon industrial 
sound analysis, and even upon physical investigations into 
the nature of sound itself. 


11 David C. Apps, Use of Binaural Tape Recording in Automotive 
Noise Problems, J. Acoust. Soc. Amer., 24, 660 (1952). 


ro y 
od ¥ 
oF 
ae 
ag 

‘ 
ee —“C™CSCSC‘(CSCiéSC a 
Pe. 
22) 
cn 
a 
= 
. 
rg 
| a 4 
i. 
i. 
: a 
ow 
a 
: Oe 
4 
a } 
\ -— 
re an a 
r Pas 
: — 
; . oe 
: - 
fi - 
4 d 
* pa 
a ‘a 
ri aa 
“5 i 
k * 
ia : 
ig . 
Mi 7 
| 
? ps 
= 
ve 
+a 


JOURNAL OF THE AUDIO ENGINEERING SOCIETY 


APRIL 1953, VOLUME 1, NUMBER 2 


An Adventure in Microphone Design 


Howarp T. SouTHER 
Electro-Voice, Incorporated, Buchanan, Michigan 


7 IMPROVEMENTS in loudspeakers and comple- 

mentary reproducing components in recent years have 
done much to point up the deficiencies in microphones. In 
the past, one of the most popular microphones has been the 
ribbon type because it can easily be adapted to unidirec- 
tional designs. But inefficient hf response, boominess in the 
high bass region caused by the proximity effect, and low 
sensitivity in spite of the large magnet size in the generating 
element have gradually tended to force into disrepute the 
use of this type of transducer. 

In television work, ribbon transducers reveal high sensi- 
tivity to shock when employed on booms. The repeated 
failure of the fragile ribbon element suggests the need for a 
more rugged and lighter microphone. The wide range 
possible with FM audio presupposes a microphone with the 
best hf response obtainable, and also an extension of the 
bass range, provided that it is clean and free from boominess. 

The ribbon microphone fails on almost every count. Con- 
siderable advantage would accrue if a microphone were 
available which weighed ounces instead of pounds, was 
small and unobtrusive, exceeded the ribbon microphone in 
sensitivity, attained much wider range, and was more rugged 
and less subject to shock and air noise. 

A microphone that meets these requirements has recently 
become commercially available. This is the Model 655 
“Slim-Trim” unit (Fig. 1). Operating as a pressure de- 
vice, the pattern is mildly directional. Although a ribbon 
microphone of the unidirectional type has distinct advan- 
tages under high noise conditions, the flexibility of this new 
microphone and the ease with which it meets most of the 
difficult conditions of television offset the single advantage 
of the ribbon transducer to a large degree. 

The design of the Model 655 presented many interesting 
problems, the solutions to which lay in the exploiting of four 
principal acoustical phenomena. These phenomena were 
susceptible to optimum use in a pressure microphone of the 
moving-coil type. In order to explain more easily the de- 
velopment of the Model 655 transducer, a digression is in 
order on the general theory of operation, assisted by a de- 
scription of several mechanical, electrical, and acoustical 
analogies. 


GENERAL THEORY OF THE MOVING-COIL 
PRESSURE-TYPE TRANSDUCER 


One of the most basic concepts of a microphone, perhaps, 
is that of a coil of wire attached to a supported diaphragm 
operating in a magnetic field and enclosed by a cavity on 


655 RIBBON TYPE 


Fig. 1. Relative size and weight of the Model 655 and the ribbon- 
type microphone. 


one side (Fig. 2). The response of the microphone to wave 
motion in the air will assume the character of the curve 
shown. This is known as a pressure microphone, because it 
responds to the pressure of the air upon its surface, reflect- 
ing some, dissipating a small portion of the energy into 
heat, and translating a certain amount into motion. That 
portion which is used to cause motion we can employ after 
the manner of an electrical generator and thus feed an 
electrical system for various purposes, such as amplification, 
recording, and reproducing. 

The virtues of this system are the extreme ruggedness, 
the relatively high output level for a given size and weight, 
and the fact that it adapts itself to the processes described 
later which promote practically flat response over the audible 
range. 

The value of the generated voltage can be calculated from 
the simple formula for a moving-coil system: 

Ey = BL(F/G) 
where Eo = generated voltage in abvolts. 
B = gap flux density in gauss. 
L = length of conductor in centimeters. 
F = force in dynes. 
G = mechanical impedance in dynes per centimeter 
per second. 
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7 


Fic. 2. F = AP, where F = force, A = area of the diaphragm, 
and P — pressure. 


The gap flux density, B, and the length of the conductor, 
L, are independent of frequency. The response character- 
istic of the microphone will be obtained by evaluating the 
force, F, and the mechanical impedance, G, as a function 
of the frequency. Observe the diagram of Fig. 2. Where 
the diameter dimensions of the case are small compared 
to the wavelength, the force exerted on the diaphragm is 
equal to the area of the diaphragm, A, times the pressure, 
P. The effect of the case dimensions upon the shorter wave- 
lengths will be explained later. 


OM 
SPRING 


MECHANICAL 
COO’ 
(COMPLIANCE) <=F 


Ce "d 
ELECTRICAL 
a ‘a 


ACQUSTICAL 


EXAMPLE " OPEN re 


DIAPHRAGM 0<a MICROPHONE 
COMPLIANCE CASE 


Fic. 3. Mechanical, electrical, and acoustical analogies. In a 
mechanical or acoustical system, C must have one side grounded. 


MECHANICAL, ELECTRICAL, AND ACOUSTICAL 
ANALOGIES 


Figure 3 presents a useful tool for the solution of acousti- 
cal problems. These analogies permit the various factors 
involved in the design of acoustical or mechanical systems 
to be translated into electrical equivalents. In most of the 
explanations that follow, a combination of mechanical and 
acoustical symbols is used to make the text clear by com- 
paring their use in the equivalent electrical circuit. Several 
explanations listed below will assist in the interpretation of 
the symbols shown in the figure. 


Mass is the mechanical element which opposes a change 
in velocity. 
Inductance is the electrical element which opposes a 
change in current. 
Resistance is the electrical dissipative element. 
Friction is the mechanical dissipative element. 
Capacitance is the electrical element which opposes a 
change in the applied force. 
Compliance is the mechanical element which opposes a 
change in the applied force. 
It should be noted that in a mechanical or acoustical 
system the capacitive element always must have one side 
grounded. 


PRESSURE MICROPHONE OPERATION 


Figure 4 shows the operation of a simple pressure-operated 
microphone of the moving-coil type. The electrical equiva- 
lent circuit of the mechanical system shown at the top of 
the drawing reveals that the compliance of the volume, C,, 
the mass of the diaphragm, M,, and compliance of the 
diaphragm, C,, form a resonant circuit, the response-fre- 
quency characteristic of which is shown in the graph. The 
height of the peak and the broadness of the base are directly 
dependent on the Q of the circuit. A simple pressure micro- 
phone of this type produces a highly sensitive unit, but with 
violently peaked response somewhere in the region between 
500 and 1,000 cps. Starting with this simple system, we 
shall now expose the methods by which a virtually flat pres- 
sure-operated microphone of the moving-coil type may be 
designed. 
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Fic. 4. Pressure microphone operation. 
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Fig. 5. Step 1: Extending the range. 


EXTENDING THE RANGE 


Step. 1. In electrical terms, the resistance added to the 
equivalent circuit, shown in Fig. 5, will lower the Q of the 
system and eliminate the previously experienced violent peak 
in the response-frequency characteristic. Referring to the 
table of analogies, we find that in the acoustical system this 
resistance can take the form of a piece of felt, and such an 
acoustical means is quite frequently used to provide the 
necessary damping for smooth response. All things con- 
sidered, we now have a thoroughly usable microphone with 
reasonably good range, similar to that acceptable for public 
address work. The lf range still leaves something to be 
desired, and the sensitivity in the high end of the spectrum 
is not considered the ultimate. 


IMPROVING THE BASS RANGE—USE OF BASS REFLEX 


Step 2. Better lf response can be obtained from the 
system if we now introduce a mass, M,, in the design (Fig. 
6). This element is in parallel with the resonant circuit 
formed by the resistance of the microphone introduced by 
the felt, R, the compliance of the diaphragm, C,, and the 
mass of the diaphragm, M,. The tube is connected from 


the outside of the case to the enclosed volume, C,. Once 
the mass and compliance of the diaphragm itself are fixed, 
the If range will be determined by the values assigned to 
the case volume and the air mass of the tube. If the case 
compliance, C,, is made too low, it will not be possible to 
improve the bass range, regardless of the tube dimensions 
chosen. 


In general, for the optimum size of this micro- 
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phone, the case volume has been determined to be no less 
than 4 in.®. It is interesting to note that the bass reflex 
loudspeaker cabinets derived their operation from the 
original employment some years ago of the principle just 
discussed in the design of microphones. Observe one phe- 
nomenon caused by the introduction of this bass reflex 
principle: Although we have improved the bass response 
efficiency to an extent, beyond a certain point we have 
caused a much more rapid attenuation of the bass range. 
Originally our attenuation slope was 6 db per octave, whereas 
now we find that the “doublet” effect introduced by the 
tube opening results in a slope of 12 db per octave. There 
is nothing more that we can do, practically, to extend the 
bass range and efficiency past this point. By careful ad- 
justment of the acoustic resistance of the felt material used 
to introduce damping, the mechanical impedance, G, can 
be made substantially constant with respect to frequency. 
We now have a microphone with good bass response, but 
the deficiency of hf response results in a displeasing quality. 


EXTENSION OF THE MID-HIGH RANGE 


Step 3. The shape of the microphone has a decided bear- 
ing on the hf range response. For years there has been 
considerable academic discussion on the diffraction of a 
sound field by an obstacle. Lord Rayleigh’ met with some 
practical success in the derivation of a theoretical expres- 
sion for diffraction in the case of a spherical object. But it 
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Fig. 6. Step 2: Use of bass reflex. 


1 Rayleigh, Theory of Sound, Vol. 2, p. 2, paper No. 287; Vol. 5, 
p. 112, paper No. 229; Vol. 5, p. 149. 
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6" DIA. CYLINDER 


— 


Fic. 7. Effect predicted by Muller, Black, and Davis. 
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Fic. 8. Step 3: Increase of mid-highs. 


was not until 1937 that Muller, Black, and Davis* derived 
practical theories applicable to semi-infinite planes, very 
long cylinders, and paraboloids of practical dimensions. The 
general solutions for diffraction caused by obstacles, par- 
ticularly of spherical and tubular shapes, have been set 
forth by these authors. Briefly, what happens is this: 

For a cylinder 6 in. in diameter and 6 in. in length, it 
has been determined that the response-frequency character- 
istic will assume closely the curve shown in Fig. 7. Observe 
that sound approaching on the axis is subject to reinforce- 
ment of approximately 10 db at 2 kc. For this cylindrical 
shape, there follows a series of reinforcements as the fre- 
quency increases. 

In line with this principle, a series of experiments reveals 
that a reduction in diameter by one-half results in a first 


2G. G. Muller, R. Black, and T. E. Davis, The Diffraction Produced 
by Cylindrical and Cubical Obstacles, and by Circular and Square 
Plates, J. Acoust. Soc. Amer., 10, 6ff (1938). 


reinforcement peak at double the first frequency, or 4 kc. 

A microphone 1 in. in diameter would seem to provide 
a happy solution to the requirement for small size, at least 
in one dimension. Accordingly, tests were conducted on a 
l-in. cylinder, and the results of these tests conformed 
closely to those predicted by the original diffraction experi- 
ments. The requirement to include 4 in.’ in the back cavity 
dictated a certain length to the cylinder which influenced 
the magnitude of the original size predictions. The exten- 
sion of the mid-high range was satisfactory to the 12-kc 
point as shown in the curve on Fig. 8. 

At this stage of development, the Model 655 compared 
most favorably with high-quality microphones available 
commercially. But in order to exceed the performance 
of existing microphones by a marked degree, further work 
was required. This will be evolved in step 4 to follow. 


REINFORCEMENT OF THE VERY HIGH FREQUENCIES 
WITH A FRONTAL RESONATOR 


Step 4. Observe in Fig. 8 that the attenuation rate past 
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Fig. 9. Step 4: Vhf reinforcement with frontal resonator. 
| anti 
{| 
|| 


| 


' 
0 iT 


| 
| 


Fig. 10. Typical response of the Model 655. 
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Fig. 11. Anechoiec sound chamber. 


12 ke appears to be about 6 db per octave. It is desirable 
to make the microphone flat to 15 kc and thus encompass 
the entire audible range in the hf direction. A cavity and 
tube on the front of the Model 655 diaphragm assembly, 
concealed in the grillwork, form the inductive and capacitive 
elements shown as C,, and M, in Fig. 9. These represent 
the cavity of the grill volume and the mass introduced by 
the grill configuration, respectively. Thus, a resonant circuit 
is formed which produces an increase in hf response which 
is essentially flat to 15 kc as shown. Past the 15-kc point, 
the attenuation is rather rapid, at a rate of approximately 
18 db per octave. 

The application of the four acoustic principles here de- 
scribed has resulted in the production of a microphone whose 
response is flat within 2 db from 40 to 15,000 cps, as pre- 
sented in Fig. 10. It may be reasonably inferred that 
practically perfect response has been achieved in this unit. 
Response from 20 to 40 cps under usual conditions of 
operation is quite likely undesirable because of extraneous 
If noise communicated through stage walls, electrical over- 
loads due to closing large doors, and other low-end hf energy 
communicating little or no program material. 


RUGGEDNESS AND RELIABILITY 


Of special importance in the design of the Model 655 is 
its extreme ruggedness. Its unusual ability to withstand 
shock and rough handling is due principally to the use of 
a nonhygroscopic specially treated, cast plastic of high 
tensile strength. Tangential compliances are formed in the 
mold to effect a complete control over the compliance char- 
acteristic with a minimum of space. This diaphragm ma- 
terial is commercially termed Acoustalloy and has very high 
tensile strength. The manufacturing process used in the 
forming of the diaphragm is carefully controlled, and the 
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tolerances on thickness are kept within one ten-thousandth 
of an inch. Nominal thickness is 1.3 thousandths in order 
to present the most favorable mechanical compliance for 
this element, in conjunction with the other elements of the 
system, to produce the best bass range. The microphone 
can be dropped on the floor without damage and completely 
submerged in water without impairing subsequent operation. 

The high resonant frequency employed in the moving 
system of about 500 cps makes the microphone resistant to 
If microphonic effects, thus eliminating in practical use the 
need for shock mounts. 


QUALITY CONTROL IN PRODUCTION 


During the development of Model 655, an anechoic sound 
chamber, shown in Fig. 11, was employed for frequent tests. 
A standard calibrated Model 640-AA microphone, shown in 
the illustration, was used for frequent calibration of the test 
electrical system. 

An unusual test procedure for production can be observed 
in Fig. 12. The equipment pictured consists of a somewhat 
smaller anechoic chamber than that used for laboratory 
work. A standard Model 655 is used in conjunction with 
the production unit inside the chamber, and a comparative 
decibel meter reads the log of the difference between the 
standard and the unit under test. Tolerances of % db are 
allowed on production. The microphones are also given an 
actual voice test, at which time cable connections and me- 
chanical rattles are screened. 


SUBJECTIVE LISTENING TESTS 


A model recording studio in which microphone develop- 
ments are conducted allows studies under actual field con- 
ditions. A monitor room with an eight-position console 


Fig. 12. Test procedure for the Model 655. 
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Po *Eo | ST ORDER 
Pez =Eo 2ND ORDER 
Fig. 13. Pressure gradient and higher-order gradient operation. 


will simulate actual conditions of use and permit the de- 
velopment engineers to record their tests and play them 
back at will for careful analysis. 

In conclusion, it may be pointed out that the test of the 
success of any design is in the public acceptance achieved. 


TREND OF MICROPHONE DEVELOPMENT IN THE 
FUTURE 


The direction for future microphone developments is indi- 
cated by the need for microphones similar in characteristics 
to the Model 655 but with the addition of the unidirectional 
feature. 


These future microphones will probably work on the pres- 
sure gradient of the first order and on higher-order gradi- 
ents. With reference to the upper half of Fig. 13 for the 
vector diagram of first-order gradient operation, P, is the 
pressure on the front of the element, and P, the pressure, 
different in phase, on the rear of the element. The resultant 
pressure on the element is shown as P,,;. Any sound whose 
wavelength is greater than the length of path from the front 
to back openings will be differentiated. Since the attenua- 
tion is proportional to wavelength, discrimination takes 
place at the rate of 6 db per octave. Therefore, to achieve 
complete discrimination, the microphone must be designed 
to cut off frequencies shorter than this critical wavelength. 

Since the ability to discriminate direction is the criterion 
of good unidirectional design, a more effective microphone 
of this type is achieved with differentiation of 12 db per 
octave by juxtaposition of a second element as shown. The 
dotted lower half of the vector diagram indicates the effect 
of this addition. The second-order differentiation is the 
lower pressure P,2. It is obvious that gradients of still 
higher order will permit smaller elements and provide in- 
creasingly effective discrimination at higher frequencies. 

Considerable work is now being conducted along these 
lines, and a more flexible transducer for industry will be 
produced in the near future. 
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Modern Methods of Filter Design 


APRIL 1953, VOLUME 1, NUMBER 2 


Lestre Norpe* 
Hammarlund Manufacturing Company, New York, New York 


A PRACTICAL PROBLEM which frequently arises 
during the design of communication systems is the 
design of filters. Usually, filters are used to permit the flow 
of electrical energy from a source to a load for some band 
of frequencies and to prevent the flow of energy at other fre- 
quencies. In spite of the great practical importance of this 
problem, the only commonly used methods of filter design 
today are based upon constant-K and m-derived sections. 
Although the constant-K and m-derived filters have been 
found satisfactory for many years in the past, they have 
several faults which can be overcome by more modern 
methods of design. The two most common faults are the 
following. . 

1. The exact frequency response of the filter is usually 
not known until after the design is complete and the response 
is actually calculated. This may result in considerable work, 
especially if the design must be altered on the basis of the 
first set of calculations. 

2. The finished filter has in general more elements than 
are absolutely necessary to obtain a desired frequency re- 
sponse. This results in an unnecessary increase in filter size 
and in price over filters designed by the method outlined 
below. 

To illustrate, let the response of a simple constant-K filter 
be compared with that of a filter designed to approximate a 
rectangular pass band. The two filters and their frequency 
responses are illustrated by Figs. 1 and 2. 

A quick examination of these figures shows that the “flat- 
top” filter is flatter within the bandpass and attenuates more 
outside the bandpass than a constant-K filter having the 
same number of elements. Consequently, if the response of 
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Fig. 1. Constant-K filter. 
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Fic. 2. Flat-top filter. 


the flat-top is to be approximated by a constant-K filter, 
more filter elements are needed. Furthermore, the response 
of the flat-top can be obtained readily from one of a set of 
exact formulas, whereas the exact response of a multisection 
constant-K filter which produces equivalent insertion loss 
must be calculated point by point. 

The technique to be described below will outline how to 
design a class of filters which: 

1. Have exactly known insertion loss characteristics, such 
that a particular frequency response can be picked before 
the filter elements are known. The filter elements are then 
chosen to have the selected frequency response. 

2. Have the smallest possible number of elements to 
satisfy the selected response. 

To present this subject in an essentially nonmathematical 
form, it is necessary to restrict it to a few special cases. 
Fortunately, even with many omissions, it is possible to 
present some basic principles and to arrive at methods of 
designing some types of filters which have considerable 
practical utility. 

In the interests of simplicity and clarity, therefore, the 
subject of design will be restricted to frequency responses 
possible with simple networks consisting of purely reactive 
x and T sections, having resistance terminations. The 
choice of z and T sections was made because they are al- 
ready familiar to those who have studied constant-K and 
m-derived filter types. As will become evident below, how- 
ever, any arbitrary “lossless” network which satisfies certain 
well-defined conditions can be chosen as a filter configuration. 
Furthermore, although it is possible to exactly design filters 
which have elements having finite Q’s (i.e., the filter is not 
lossless), this procedure is more involved and hence will not 
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Fig. 3. Filter network between generator and load. 
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be considered in this paper.’ For all practical purposes, 
reasonably high Q’s possible with modern toroidal inductors 
and capacitors of good grade will yield a filter with a re- 
sponse very close to the response possible with filter elements 
of infinite Q’s. 

The design of filters can be made very convenient by first 
designing a low-pass prototype which satisfies the desired 
frequency-response conditions and then transforming the 
prototype to a bandpass, high-pass, or band-suppression 
filter as required. The low-pass prototype is usually de- 
signed for unit generator and load resistances and a normal- 
ized frequency scale, and the filter impedance and frequency 
scale are transformed to values which are required by the 
actual specified conditions. Designing filters in this way 
keeps the numerical values convenient and in general sim- 
plifies the necessary computations, thus reducing the prob- 
ability of numerical or analytical errors. The normalized 
frequency scale is simply a scale in which the cutoff radian 
frequency (2zxf) is unity and all other frequencies are re- 
ferred to it. The process of transforming the filter to the 
practical requirements of frequency and impedance will be 
shown below. For the first part of the subject, therefore, 
the low-pass prototype will be considered. However, all 
statements about the prototype are applicable to any other 
filters into which the prototype can be transformed. 

For the purpose of defining filter performance, let a filter 
be represented by a black box having four terminals, as 
illustrated by Fig. 3. 

The behavior of this system is completely defined by two 
simultaneous equations.’ 

E= Zi), -Zial2 (1) 
O = —Zy21; + Zale (2) 
where 

Zi; = open-circuit driving-point input impedance seen by 

the generator with output circuit opened at B. 

Zo = open-circuit driving-point impedance seen at load 

end, with generator disconnected at A and im- 
pedance measured looking in at open-circuited B. 
Zi2 = mutual impedance between generator and load. 


1 Milton Dishal, Design of Dissipative Band Pass Filters Producing 
Desired Exact Amplitude-Frequency Characteristics, Proc. Inst. 
Radio Engrs., 37, 1050-1069 (1949). 

2H. W. Bode, Network Analysis and Feedback Amplifier Design, 
pp. 4-10, D. Van Nostrand Co., New York, 1945. 


To illustrate further, consider a simple T circuit connected 
to a generator and load (Fig. 4). Let the inductive and 
capacitive reactances be defined by eqs. 3 and 4. 


pL = Inductive reactance (3) 


As = Capacitive reactance (4) 
pc 
For convenience and clatity, p is used here to represent a 
normalized complex radian frequency, in place of the com- 
monly known jw. As will be shown below, the response of 
the filter to real frequencies is determined by substituting 
jo for p in the various expressions of frequency response. By 
the use of the complex frequency / the various impedance 
expressions in the following analysis become simple, and at 
the same time no generality is lost in the various derivations 
to follow. The normalized complex frequency can be repre- 
sented as 


p=a+jxe (j=Y-1) 


where a is any real number corresponding to an abscissa, 
and x is the normalized real radian frequency 27f (analogous 
to ») corresponding to an ordinate. Any complex frequency 
p can be represented, therefore, as a point in a plane having 
a and jx as coordinates. 

For the network illustrated, 


1 
21: = R, + pL + — 
pc 


If eqs. 1 and 2 are solved simultaneously, an expression for 
the load current J. can be found. 

EZ;2 

a 
Z1:Z22 —- 212" 


If the transfer impedance Z, is defined as Z,; = E/J», then 
in terms of the circuit parameters 


_ £222 — 212" 
Z12 


Z; (5) 


The transfer impedance Z; is a very useful filter property, 


Fic. 4. Typical low-pass cireuit. 
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because it relates the load current /2 to the applied generator 
voltage E, as in eq. 6. 
E 
Zz; 
With /. and the load resistance R; known, the power ab- 
sorbed by the load is | /2 | Rz, where | J2| is the absolute 
value of load current. This is the current which an ammeter 
would read in series with R,,. 


In terms of the applied voltage EZ, the power output into 
the load is expressed by eq. 7. 


b= 


(6) 


Ee 
|Z, |? 


Picaa = (7) 

Equation 7 expresses the power dissipated in the load 
for any generator voltage. To specify the filter response 
completely, it is necessary to relate the output power in 
some way to the power delivered by the generator. If R, 
is the internal impedance of the generator, then maximum 
power can be drawn from the generator when the input 
impedance of the filter equals R,. Under this condition, 


| al 


Poax — 


(8) 


Since the filter itself is assumed to be lossless, however, all 
this input power must be dissipated in Rz. Therefore 
Pax — Maximum power available in load 


By dividing eq. 7 by eq. 8, a transmission function of 
power is defined, as indicated by eq. 9. 


_ Proaa 4 RyRi 

Puox | Zz; |? 
where | Z;| is the absolute value of transfer impedance. 
For a normalized filter network R;, = R, — 1, by defini- 


tion. Substituting this in eq. 9, a normalized transmission 
function is found. 


(9) 


ie (10) 
|Z; |? 


Equation 9 or 10 implies certain restrictions on the trans- 
mission functions possible with any passive bilateral net- 
work. It is obvious that G must lie between 0 and 1; other- 
wise the law of conservation of energy is violated. Further- 
more, G must be an “even function” of real frequency; i.e., 
G must be symmetrical about x equal to zero. These restric- 
‘tions can be formally expressed by eqs. 11 and 12. 


0<G<l (11) 
G(x) = G(-*) (12) 


‘where G(x) means that G is a function of normalized real 
frequency. 


ENGINEERING SOCIETY 


Before proceeding further, it is necessary to examine the J 


various forms the transfer impedance can take for any net- 
work. It can be shown that any impedance expression of a 


linear network consisting of a finite number of elements has | 


the form of a rational fraction, as illustrated by eq. 13. 
P™ + ani p™*? + °° * +4 


Z(p) =A (13) | 


pm + bmp” ao eee + bo 
The form illustrated by eq. 13 is the same, regardless of 


what impedance expression is written. In particular, it is ] 


correct for the transfer impedance expression. For example, 


the transfer impedance for the T circuit of Fig. 2 calculated } 


from eqs. 3, 4, and 5 is shown below for R, = Ry = R. 


soe fon(B) eB) 


Comparison of this expression with eq. 13 shows that 
A=fC 


bb = 1 
and all other 5’s are zero. 

The general expression for impedance can be used to cal- 
culate the response of the impedance to real frequencies by 
letting the generalized complex frequency p — jx. When this 
substitution is made, all the even powers of (jx) become real 


numbers, and all odd powers of (jx) become imaginary num- | 
bers. If the complex fraction is rationalized, and if all the | 


real terms are collected and are equal to R(x) and all the 
imaginary terms are collected and are equal to X(x), then 
the general impedance function Z(p) becomes 

Z(jx) = R(x) + jX(x) (14) 
Equation 14 expresses the form taken by any impedance 
expression for real frequencies. 


It should be further noted that, since R(x) contains all | 


even powers of x, R(x) is an even function; since X(x) 


contains only odd powers of x, it is an odd function. These | 


facts are expressed analytically by 

R(x) = R(-*) (15) 

X(x) = -—X(-«) (16) 
To illustrate this more clearly, for the T network of Fig. 4, 
the value of Z; for real frequencies takes the form 

Z, = A(do— ax") + jAx(a; — x*) 

It is seen that the real and imaginary parts satisfy eqs. 15 
and 16. These relationships, which define the form any 
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impedance function takes, are very useful as an analytical 
check on expressions of impedances obtained during design; 
if at any time the form of impedance obtained does not 
satisfy eqs. 15 and 16, it is certain that an algebraic error 
was committed during the derivation of the given impedance 
expression. 

The general expression of impedance as shown by eq. 13 
is a quotient of two polynomials. The polynomials are in 
powers of » and have a finite number of terms. From the 
theory of equations it is known that the numerator and the 
denominator can be factored into m and m terms, respec- 
tively. Letting the numerator be N(p) and the denominator 
be D(p), these can be expressed as a product of factors: 


N(p) = (b- fo) (b- pi) (P-2) * * * (P-bn) 


and 
D(p) = (p-10) (P-11) (P-12) * * * (D-Tm) 


where po, pi, p2*** p, are the n roots of N(p) = 0, also 
known as zeros of Z(p), and ro, 71, 2 * * * 7m are the m roots 
of D(p) = 0, also known as poles of Z(p). 

The impedance expression (eq. 13) can therefore be ex- 
pressed as a product, 


Z(p) = A P= Po) (P- Pr) (p — Pn) 

(p—1ro) (p-11) °° * (P-tm) 

The values of the roots of such a polynomial in general 

can be zero, real and positive, real and negative, or can occur 
as conjugate complex pairs. 

If the impedance expression is to correspond to a physi- 


(17) 
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|Z, |2 = LP Ped (P— Po) (P+ Po) (P + Po) °° 


all have negative real components. 

Equation 10 defines the relation between a transmission 
function and the reciprocal of the squared absolute value of 
a transfer impedance. The transmission function as defined 
by eq. 9 in terms of the maximum power available from a 
generator of internal impedance R, and the power delivered 
into a load R, is usually obtained from the specifications of 
the desired filter, moderated by the limitations indicated by 
eqs. 11 and 12. Before any network can be found which has 
the required transter impedance Z;,, it is necessary to find a 
general expression Z; corresponding to the squared absolute 
value of the transfer impedance; to restate in different words, 
given | Z, |*, what is the value of Z;? 

The relation between Z, and | Z; |? can be found by re- 
calling the facts that G must be an even function of fre- 
quency and that Z, must have conjugate complex roots. 
Using the general complex frequency p instead of the real 
frequency x as the independent variable, these facts can be 
restated formally as 


G(p) =G(-?) (12a) 

|Z:(p) |? = | Z:(—p) /? (12b) 

Equation 12b implies the fact that if a complex number 

r, is a root of the denominator (or numerator) of Z;, then 

—t; is also a root. Consequently | Z, |* will have roots of 

the form (—a + jx), (—a— jx) and (a+ jx), (a— jx). 

Denoting these possible roots as p,, p, and —hr, —pr for 

the numerator and r;, 7, and —r,, —r, for the denominator, 

the expression for the squared absolute value of transfer 
(or any other) impedance can be put in the form: 


* (p— pn) (P— Pn) (P + Pn) (DP + Pn) 


(p—ro) (p—10) (P+ 10) (P+ 10) °° 


cally realizable network, the values such roots can have must 
be restricted.* For the transfer impedance considered in 
this paper, the “natural mode” of the network is in the 
numerator as shown by eq. 5. The natural behavior of the 
transfer impedance is defined by Z;;Z22 — Z2*. The roots 
of the equation Z;;Z22 — Z,.? — O must have negative real 
components if the law of conservation of energy is to hold. 
This becomes apparent when the transient behavior of the 
network is considered. If the network is excited by a single 
pulse, the transient currents induced eventually reach zero, 
since energy is dissipated in the terminating resistances. 
Consequently, if Z(p) of eq. 17 represents a transfer im- 
pedance as defined by eq. 5, the roots in its numerator must 


3 See H. D. Bode, Network Analysis and Feedback Amplifier Design, 
pp. 226-233, also Chapter 7, D. Van Nostrand Co., New York, 1945. 


Roots of | Z; |? numerator with (—) real parts 


. (18) 
z (p—Tm) (p—Tm) (p + Tm) (p + Tm) 
neglecting scale factors. From eq. 18 it is apparent that the 
denominator of | Z, |* contains two sets of factors, one set of 
factors containing real or complex conjugate roots with 
negative real components, and another set of factors con- 
taining real or complex conjugate roots with positive real 
components. The factors of the numerator can similarly 
be grouped. 
Let Z, = rational fraction containing zeros with positive 
real components. 
Z. = rational fraction containing zeros with negative 
real components. 


Then iZ,.f° =2,x €. 


The value of Z; therefore must be Z., if it is to represent 


a transfer impedance corresponding to a physically possible 
network. 


Roots of | Z, |? denominator with (+-) or (—-) real parts 


(19) 
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Equation 19 relates the transfer impedance Z, to | Z; |’, 
thus defining a relation between the transmission function 
G and the transfer impedance of a network, except for a 
possible scale factor. (The denominator of Z; contains no 
“natural modes”; therefore its roots may have positive or 
negative real components, as long as complex roots occur in 
conjugate pairs. ) 

Having found Z,; from the transmission function obviously 
does not directly yield an actual network. Consequently, 
the last step in the design of the filter is to find a network 
whose transfer impedance is Z, corresponding to the desired 
transmission function. There are several analytical methods 
by which a network can be deduced if the transfer impedance 
of the network is known; however, all such methods require 
a mathematical background which cannot be profitably laid 
down within the scope of this paper. Since the responses 
considered were originally restricted to those possible with 
simple + and T networks, for the purpose of the present 
discussion it is sufficient therefore to assume a network con- 
sisting of x or T sections and write the transfer impedance 
of the network in terms of network parameters of R, L, and 
C. When a transfer impedance is found from such a network 
which is identical to the desired Z,, except for a scale factor, 
the value of the network parameters can be found by 
equating the coefficients of like powers of the frequency 
variable and solving the equations so formed. 

Before proceeding with an illustrative example, the design 
steps are briefly summarized below for ready reference. 

1. From the specification of the desired network response, 
set up an analytical expression of transmission function G 
which meets the specified requirements and satisfies 

0<G=l1 (11) 
G(x) = G(-x) (12) 

2. From eqs. 10 and 19 find the required network transfer 
impedance Z,. 

3. Assume a suitable network and write its transfer im- 
pedance Z,’. 

4. From the identity, neglecting scale factors, 

Z,2Z; 
form equations by equating the coefficients of like powers. 

5. From these equations, solve for numerical values of 
the network elements. 

Although the design process summarized by these five 
steps was developed on the basis of a transfer impedance of 
a lossless filter, it is possible by an almost identical develop- 
ment to arrive at a design procedure based on the input im- 
pedance of a filter. Furthermore, for either input or transfer 
impedance a dual process can be reached, based on the input 
or transfer admittance of networks. 

To illustrate the general process of network approxima- 
tion, a practical problem is presented below. In this prob- 
lem it is assumed that the network is low-pass, normalized, 


and that the desired frequency response is specified by prac- § 
tical considerations. Suppose that the frequency response § 
of the network is defined as follows: 


Frequency, 
radians Power response 
0 0 db 
1 3 max db down 
2 15 min db down 
3 25 min db down 


The relatively simple transmission function, 


G = 1/(1 + 2) 


will produce the following response: 


Power response, 


Frequency (7) db 
0 0 
1 - 3 
2 -18 
3 —28.7 


Since the response of G is satisfactory and G meets the 
requirements of eqs. 11 and 12, it will be used. 

From eq. 10: 

[Z,[?= 1+ p* 

using p as the frequency variable and neglecting the scale 
factor 4. Solving 1 + p* = O to obtain the factors of 
|Z; |, it is found that the roots are the 6 sixth roots of —1. 
These roots are: 
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ps=1 
The roots shown on the complex plane (Fig. 5) lie on the 
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Fie. 5. Location of complex roots of 1+ p*=— 0. 
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tips of a regular hexagon inscribed inside a circle of unit 
radius. 

Since all the roots are contained in the numerator of 
| Z; |*, only those which lie to the left of the imaginary axis 
can be used. Therefore 


Z, = (p- 1) (p— pr) (Pp - fz) 
Multiplying out, 


Zi = P-P?(Pit Pit bs) +P [Pidi+ po(pit pr) |-PiPipe 

Substituting the numerical values of the three roots se- 
lected, the final expression for Z; becomes 
Z,= p+ 2p? + 2p+1 

The normalized transfer impedance of the T network 


considered before has the identical form, except for the scale 
factor L°C. 


Z/=Lc\pP+ (| — ) - — 
Ptr = oe aes 


Equating coefficients of Z,; and Z,’, 


and 


Since only LZ and C are unknown, any two of these equa- 
tions can be solved simultaneously. From the first and last, 


L=1 6.22 

The required filter having the power response G in nor- 
malized form is shown by Fig. 6. 

To check on the actual response of this filter, its response 
to real frequencies will be found by substituting p = jx in 
the expression for Z,’. 

Z, == 2(1 — 2x") + 2jx(2 -— 2x") 
|Z,’ |? = 4(1 — 2x)? — 4x?(2 — x*)? 
= 4(1+ x") 

Substituting this in eq. 10,G = 1/(1 + x*) or the original 
transmission function is obtained. This, then, is the actual 
network response. 


and 


Rg?! Ll Ls 


i 


Fig. 6. Low-pass filter. 


NORDE 191 


It is interesting to note that the specific transmission 
function chosen here is the same as the actual transmission 
function for a single-section constant-K filter, although the 
derivation was entirely different from that of a constant-K 
circuit. The choice of this simple function was made to 
illustrate the use of the general concepts derived above. 

The normalized filter shown in Fig. 6 can be transformed 
to a practical filter by multiplying the normalized filter ele- 
ments with various constants. The exact method of doing 
this is explained and illustrated below. 


FLAT-TOP APPROXIMATING FUNCTIONS 


The ideal filter response for some applications is one which 
has a small insertion loss in the pass band and very rapidly 
increasing insertion loss outside the pass band. Actually, 
if no limit is placed on the number of filter elements (or the 
cost), the ideal flat-top response can be approached by means 
of the simple transmission function of the type used above.* 
In general, this transmission function can be represented by 


— 1 
1+ x" 


By making » large, the variation of power transmission 
is small for small values of x, and the power transmission 
decreases very rapidly with increasing value of x for large 
values of m. However, a large value of m means a corre- 
spondingly large number of circuit elements. Since cost 
and size of a filter frequently limit the design, a transmis- 
sion function more suitable for flat-top response is indeed 
desirable. Furthermore, it would be desirable to arrive at 
a transmission function which for a given number of filter 
elements has the smallest insertion loss in the pass band and 
the highest skirt selectivity possible outside the pass band. 

The approximating functions constructed from Tcheby- 
cheff polynomials satisfy these requirements. With the use 
of these transmission functions it is possible to design to a 
given filter response with a minimum number of elements. 
Stating this another way, for a given number of filter ele- 
ments and a given maximum allowable insertion loss within 
the pass band, the greatest possible skirt selectivity is ob- 
tained when a Tchebycheff approximating function is used. 

The Tchebycheff polynomial of interest is formally de- 
fined by 


(20) 


T,(x) = cos (n cos" x) (21) 
where 7,,(x) = the Tchebycheff polynomial. 
n= 1,2,3,°* * * (a positive integer). 


Although not readily apparent, 7,,(x) is a polynomial in 
powers of x, whose highest power is . This can be shown 


4W. H. Huggins, The Natural Behavior of Broadband Circuits, 
Rept. No. E5013, Air Force Cambridge Research Laboratories, May, 
1948. 
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Fig. 7. Sketch of [T7, (x) ]* and [7; (2) }*. 


by a simple substitution and trigonometric expansion, as 
follows: 
T3(x) = cos (3 cos" x) 

Let cos'x = @ 
Then T3(x) = cos (38) 

T3(x) = 4 cos* ©—3 cos ® 

= 4 cos* (cos? x) — 3 cos (cos? x) 
T3(x) = 4x° — 3x 


7 1 7 1 
»= +sin™ (44-) sinh (2) + joos~ (++) cosh (a) 
n 2 es 2 


In a similar fashion the polynomial can be expanded for 
other values of n. Some of these polynomials are listed be- 
low for reference. 


T;(x) =x (21a) 
T2(x) = 2x?-1 (21b) 
T3(x) = 4x° —3x (21c) 
T4(x) = 8x*- 827 + 1 (21d) 
Ts(x) = 16x° — 20x* + 5x (21e) 


On examination of these polynomials, it is seen that their 
absolute value is unity for x = 1. For x = 0, their absolute 
values are zero or unity for m odd or even, respectively. Be- 
cause some of these polynomials are not symmetrical with 
respect to the ordinate, they cannot be used directly in any 
transmission function. The polynomial squared, however, 
is an even function for any value of m. Tchebycheff poly- 
nomials therefore must always be used squared in transmis- 
sion functions. The squared polynomials have very distinc- 
tive properties, listed below in tabular form. The shapes of 


Value of x Properties 
1 [7,(#)]?=1 
0 [T, (x) ]}*? = 0 for n odd 
[7, (x) ]?=1 for n even 
z>>1 [T, (2) ]? a 20") a 
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the polynomials [7\(x)]* and [73(x)]* are sketched in 
Fig. 7. 


There are many types of useful transmission functions | 
possible with Tchebycheff polynomials. The first and most ] 


common type is defined by 
ah 1 
1+ [eT,(x) ]* 
where « is a real number such that 0< «<1. This trans- 
mission function has » maximum points and (m—1) mini- 
mum points inside the pass band (ie., —1< x <1) and 
decreases smoothly outside the pass band. The values of G 


(22) 


at the minimum points and at x = +1 are equal; i.e., the 


response goes through equal up-and-down variations up to 
the edge of the pass band. This type of transmission func- 
tion is sketched in Fig. 8. 

To use the transmission function (eq. 22) it is necessary 


that the denominator be factored. The factoring process is 


straightforward and is shown in the Appendix. 

The equation [«7,(p)]*-+ 1 = 0 has 2m roots which 
may be real or conjugate complex. The Ath root of the 
polynomial is given by eq. 23. 


(23) 


1 1 
a = — sinh" — 
n € 


k=0,1,2°°* (m-1) 
For example, the roots corresponding to m = 3 are: 


where 


8 sdk Ge 


1 
— -— sinh "i St 
pi = sinh (a) + j : 


po = — sinh (a) + a cosh (a) 


ps = sinh (a) 
ps = -sinh (a) 


} = 


ib, ae (e} - pee (a) 
2 2 k=2=(n-1) 


Pe= ~~ sinh (a) i cosh (a) 


The locus of the roots p, is a simple geometric figure when 
represented on the complex plane. 


Let sinh (a) =A 
cosh (a) = B 
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Fic. 8. Normalized Tchebycheff transmission function. 


Substituting in eq. 23: 


pi = Asin 6 + jB cos ¢ 

As the value of ¢ goes through all values from 0 to 2x 
radians, p, will have points on an ellipse, having a major 
axis 2B and a minor axis 2A long. This is illustrated by 
Fig. 9. 

The simple geometric property of the locus on which 
all the roots fall can be used to obtain the roots of 
1 + [e7,(p) ]* in the following simple manner. 

First, find the 2m roots of —-1 by solving the equation 
1 + p°"™=0. These roots fall on a circle and have angular 
separation of +/n radians, as was pointed out above. Second, 
to obtain the roots of 1 + [«7,(p)]* = 0, multiply the real 
components of the roots of —1 by sinh(a), and multiply the 
imaginary components by cosh(a). For example, the six 
roots of —1 were found to be: 


2 2 


Multiplying the real parts by sinh(@) and the imaginary 
parts by cosh(a), the roots of 1+ [e«73(p)]? = 0 are 
obtained as before. 

Having obtained the roots of the denominator of the 
transmission function G, the process of the filter design is 
identical to that with any gain function. As shown above, 
in order to obtain the transfer impedance function those n 
roots are selected whose real parts are negative. 

To illustrate the use of the Tchebycheff polynomial, the 
design of a low-pass filter will now be described. 

Suppose that a low-pass filter, having the following speci- 


fications, is desired: 
1. Terminating resistances are 1 ohm. 
2. Maximum insertion loss within the pass band is 1 db. 
3. Minimum insertion loss at x — 1.2 is 6 db. 
4. Minimum insertion loss at x = 2 is 20 db. 
5. Minimum insertion loss at x = 4 is 40 db. 
To design the filter, the transmission function 


1 


C= 
1+ [eT (x) x 
will be tried. The problem now is to find ¢ and the order 
of the Tchebycheff function, from the known information. 
It should be recalled that [7,,(x)]? is unity for x = 1. 
Letting x = 1 be the cutoff frequency, and noting that the 
response is 1 db down at cutoff, 


G (0) 
101 = 10log (1+ 2) =1 
"eG u) og (1 + &) 


Solving for ¢’, it is found that @ = 0.26; « = 0.51. There- 
fore, the transmission response is 


1 
C= 
1 + 0.26[T,(x) }* 
Further, at x = 1.2, the insertion loss should be at least 6 
db. Trying the function 73(x), 


G= . 
1 + 0.26 [73(x) ]? 


and the loss in decibels is 10 log 3.78 = 5.8 db. Checking 
as = 2, 


(24) 


1 


174 
Loss in decibels = 10 log 174 = 22.4 


Similarly, at x — 4, the loss is found to be 41.9 db. These 
results are summarized in the following table. 


Normalized radian Desired insertion Caleulated insertion 
frequency loss, db loss, db* 
0 0 
1 1 
6 5.8 
20 22.4 
40 41.9 


* From eq. 24, insertion loss is 10 log,, {i + 0.26 [7,() ]*}. The nu- 
merical values of T,(2) are obtained from eq. 21c. 
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Fic. 10. Normalized flat-top low-pass filter. 


Comparison of the desired with the calculated filter re- 
sponse indicates that, for practical purposes, the third-order 
Tchebycheff function is satisfactory. 

Accordingly the desired transmission function is: 

1 
C= 
1 + 0.26[T;(x)]}* 

The roots of the denominator can be found from the 

six complex roots of unity by solving the equation 


1+ p°=0 
The roots as before are located on the circumference of a 
circle of unit radius (refer to Fig. 5). From eq. 23, 


(24) 


1 1 1 
@ = —sink* ——_ — — sinh" 1.96 
3 0.51 3 


From a table of hyperbolic sines and cosines:® 


a = 0.475 
sinh (a) = 0.493 
cosh (a) = 1.115 
Multiplying the real parts of the roots above by sinh(a) 
and the imaginary parts by cosh(a), the desired roots be- 
come 


Ellipse half axes 


p1 = -0.247 + j (0.965) 
pi = -0.247 — j (0.965) 
pe = —0.493 


The factors of the transfer impedance are 


Multiplying out this expression and neglecting scale factors, 
the desired transfer impedance becomes 


Z, = p® + 0.987 p? + 1.236 p + 0.49 


This transfer impedance is identical in form to the transfer 
impedance of the T network used in the first example. 


2R 
+ 
Lc } 


From the identity Z, = Z;’ the values of the filter elements 
are found by equating the corresponding coefficients. 


2R R? 
—— = 0.987 = 1.236 = = 0.49 
L L °C 


2R 2 R 
Z/ = LC P+e—t+p(—+ =) 
L | Oe 


2 
= -- 
LC 


5 Decibel Tables and Tables of Hyperbolic Functions, Bell Telephone 
Laboratories Pamphlet. 
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(p +.0.493) {p—[-0.247 + j(0.965)]} {p— [-0.247 — j(0.965) ]} 


Letting R = 1 and using the first and last relations, 


L254 
C = 0.993 f 


The desired low-pass filter is therefore illustrated by Fig. 10. | 
The transfer impedance of the filter, including the scale j 
factor L7C, is 


Z;' = 4.08 (p* + 0.987 p? + 1.236 p + 0.49) 


at P = 0 and Z; = 2, which checks with the transmission 
function requirement that G = 1 at zero frequency. 

As a further check it would be desirable to calculate the | 
actual output power into the load as a function of frequency, 
with a constant-amplitude supply voltage. Using the ex- 
pression for Z,’ and substituting p — jx, the transfer im- 
pedance as a function of frequency becomes 


Z; = (2-—4.02x") + jx (5.04 — 4.08x") 


|Z, | = V 16.7x — 25.1x4 + 9.32x7 + 4 
The power flowing in the load s (R, = 1): 
a E 
|Z,|2  16.7x®— 25.1% + 9.32x7 +4 


The maximum power which can be delivered to the load is: 


\2 


Ti 


(z= 4%) 


Sie 1 
Pax  4.15x° —6.27x* + 2.33x2 + 1 


It is apparent that the above frequency response is identi- 
cal to the original Tchebycheff response selected, within a 
small arithmetical error; therefore, the actual response of 
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Fig. 11. Frequency response of constant-K and Tchebycheff low- 
pass filter prototypes. : 
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For purposes of comparison, the constant-K response is 
plotted on Fig. 11 along with the Tchebycheff response of 
the last illustrative example. It may be seen that, within the 
pass band (i.e., | x | <1) the Tchebycheff filter is flat within 
1 db and the attenuation increases rapidly outside the pass 
band. For larger values of frequency (| x |> 2) both filters 
attenuate at the same rate with increasing frequency; how- 
ever, the attenuation of the constant-K filter is about 6 db 
lower. The shapes of the frequency-response curves of these 
two types of filter remain the same with respect to each 
other, regardless of the number of elements in the filter, as 
long as the two filters compared have the same number of 
elements. Generally speaking, therefore, for a given number 
of filter elements the Tchebycheff network will have a flatter 
bandpass and lower skirt than the corresponding constant-K 
filter. 

It is not necessary to go through the computations to find 
the roots of <; every time a filter design is required, because 
it is possible to deduce design formulas for Tchebycheff re- 
sponses containing 7;(x), T2(x), T3(x), etc. For the re- 
sponse used in the last example, it was shown that the roots 
of the transfer impedance are 


y= — sinh (a) + iN cosh (a) 


— [3 
A= ~—sinh (a) AS cosh (a) 


p2 = -sinh (a) 


where a is as defined for eq. 23. 
neglecting scale factors, 


It was also shown that, 


Z: = (p- pr) (p— ps) (p— pz) 


or 


Substituting the numerical values of the roots, 


Fig. 12. Typical x network low-pass filter. 


circuit values for a single-section + network can be found. 
The design equations for the z network are shown below, 
and the corresponding circuit is shown by Fig. 12. 


= anh Gah (29) 
C 


3. + 2. = 2 sinh? (a) + ud (30) 
Cc 4 


LC 

The values of a and « have the same definitions as before. 
The x network is useful when the filter is to operate from a 
constant-current source. 

In a similar manner, the design equations for more com- 
plicated filters can be worked out, keeping in mind that the 
response of the more complex filter corresponds to Tcheby- 
cheff response of higher order than 3. The order of the 
Tchebycheff response corresponding to a given network is 
the same as the exponent of the highest power of p in the 
transfer impedance expression of the network; i.e., if p° is 
the highest power in Z,’, then the corresponding transmission 
function is 

1 


C= 
1+ &[T;(x)]? 


FILTER TRANSFORMATIONS 
Although the design of filters is conveniently done on the 


Z,= f° -P? (pi t+ Pit pe) + P [Pidi + Pe (Pr + pi) |— Pippo 


Z,= pf + p?[2 sinh (a) ] + p[2 sinh? (a) + 34] + sinh (a) [sinh? (a) + 34] 


Again from the identity Z,; = Z;' the filter elements can be 
related to the coefficients of p by the following design equa- 
tions, noting that R, = R,; = 1: 


(25) 


1 
— = sinh (a) 
A 


2 


ae age 2 sinh? (a) ret (26) 
ae 4 


1 1 
a = — sinh! — (27) 
3 


€ 
2 
a 


= Peak-to-peak value of G within the pass band (28) 
i+eé 
These are design equations for a single T-section normalized 
low-pass filter. By the use of “nodal” circuit concept the 


basis of a normalized prototype, the values of the filter ele- 
ments must be scaled eventually to the actual impedances 
and frequencies required by the initial filter specifications. 
Furthermore, the filter configuration which is required may 
be bandpass, not low pass like the prototype, and conse- 
quently the filter configuration may have to be changed to 
suit the actual problem. 


There are therefore usually three basic alterations neces- 
sary on the prototype, namely, change of impedance scale, 
change of frequency scale, and change of configuration. 
The first two changes, if properly made, do not alter the 
shape of the frequency response of the filter, and the changes 
may be effected in any order to suit the convenience of the 
designer. 
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HIGH- PASS 


C= 


Wo" L 


“Ee 


BAND-PASS 


®. = Oy — OL; Oy = VY oy o,, 


FREQUENCY 


Ly = 
@o°C 


t= 


Wo" | 3 


We RADIAN CUT-OFF 


~~ BAND-ELIMINATION 


OD. = O7 — OL, % = V Oy 7, 


Low-pass prototype Transformed filter 


Transformed response 


Transformation formulas 


Fig. 13. Filter transformations. 


The change of frequency scale requires that any reactance 
in the filter remain constant when the cutoff frequency is 
changed numerically. In other words, if a reactance is, 
say, 3 ohms at the normalized cutoff frequency, it must be 
3 ohms at the new cutoff frequency. 

Let Lo = any inductance in prototype. 

Co = any capacitance in prototype. 
Z =any inductance in final filter. 
C =any capacitance in final filter. 
x, normalized cutoff radian frequency (this is 
unity by definition). 
, = final radian cutoff frequency (#, = 2zf,). 
Ro= normalized terminating resistance (Ro = 1). 
R = final terminating resistance. 
To keep impedances invariant, therefore, 


XeLo = o,L 


and thea 


or, since x, = 1, 


(31) 


(32) 


To change the frequency scale, therefore, all inductances 
and capacitances in the prototype must be divided by the 


new radian cutoff frequency. Since resistance has the same 
value at any frequency, the generator and terminating re- 
sistances remain the same. 

The change of impedance scale requires that the quotient 
of any one reactance or resistance divided by the terminating 
(load) resistance be a constant. Therefore, 

x-Lo Sel. 1 1 
—— and = 
Ro R %CoRo x-CR 


or, since Ro = 1, 


(33) 
(34) 


The transformation of the low-pass filter into a high-pass, 
bandpass, or band-elimination type requires that the input 
and output terminations remain unchanged and that the 
response at corresponding frequencies be unaltered. The 
proof of the transformations is lengthy; consequently only 
the actual transformation formulas and corresponding circuit 
arrangements are shown above. It should be emphasized 
that the responses are unaltered at corresponding frequencies, 
which may have different numerical values. For example, 
the response of a low-pass filter at one-half its cutoff fre- 
quency will be the same as the response of a high-pass filter 
at twice its cutoff frequency. The various transformations 
are illustrated by Fig. 13, along with the formulas of trans- 
formation. The shaded portions of the bandpass curve cor- 
respond to actual physical performance; the unshaded por- 
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tions for negative frequencies illustrate the symmetry postu- 
lated by eq. 12. 

To illustrate the use of the various transformations, the 
normalized filter of the last example will be transformed 
into a bandpass filter having the following specifications: 


Center frequency = 5,000 cps 
Bandpass (1 db down) = 1,000 cps 
Input and output termination = 600 ohms 


First, the frequency and impedance transformations are 
made: 


©, = 247 X 1,000 = 6,280 
2.03 

L=—=3.24 xX 10* h 
oe + Frequency scale 


“a 0.993 transformation 


—— = 1.58 k 107 f 
’ 0 
L =3.24 x 10* * 600= 0.194h } 
1.58 &K 104 
Cc je Seon. 10° = 0.263 pf 
600 


Impedance scale 
transformation 


a 10° 
"(Qe X 5,000)? X 0.263 
Bo 10° 
(2% X 5,000)? 0.194 
fr = 4,530 cps 
fu = 5,530 cps 


The transformed filter and corresponding bandpass is 
illustrated by Fig. 14. 


L 


= 3.84 & 10*h Change to 


bandpass 
(refer to 
Fig. 13) 


= 5.25 & 10° uf 


CONCLUSIONS 


It was shown that filters designed by prescribing the 
locations of the network poles and zeros have certain well- 
defined advantages over filters designed on the classical 
image impedance basis. The design of filters by pole and 
zero locations, however, is somewhat more laborious than 
the design by classical methods and hence may well be use- 
ful only where the economy of materials and space permit 
the expenditure of more engineering effort. 
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APPENDIX 


Roots of 1 + [«7,,(x)]* = 0* 
Solving for T,, (x): 


T, (x) = + j— 
€ 
cos (ncos?x) = + 
ncos'x=a-+ jp 
cos a cosh 8 —j sina sinh 8 = + 
cos a cosh 8B = 0 


1 
sin a sinh B = +— 
€ 
TT 
From eq. 35, a war tes kn 


.. Sina = (—1)* 
From eq. 36, 


1 
sinh 8 = (-1)* ( 2—) 


* This process was obtained from Dr. H. J. Carlin of the Polytechnic 
Institute of Brooklyn. 
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8 = sinh" [-o-(=+)] nae 


Substituting for a and B: 


1 
# cos? x, = — ++ ke + jsinh (+—) 
2 € 


1 
sinh na = — 
€ 


7 1 7 1 
n= 0s (—+ 4 ) cosh (a) s-jsin™ (+ & ) sinh (a) 


n n 
If p, = jx, and roots with negative real components are selected, 


7 1 7 1 
p =-in= (+2) sinh (a) + joos~ ( — +h) cosh (a) 
n 


n 
where k= 0,1,2 °° * (m—1). 
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* 1 1 
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ser n € 

S ituting in eq. 37, 

% PC nh fps 
- costx, = — | —+k } + ja t 
im n 2 

a (37) and 

: 

ve - l ae 
ei Let x, = cos |— | —-++& } + ja 

a n \2 | 

om q 
it 4 
& 

a 

Pi 
7 a 
£5 

m | 
ae 

ey - 
= 

3 

es ; 
ae : 
es 

oe 
xa 
red 

25 
ms 

a 

wite 

: 

<4 

es 

Se 
net, 

a 

Be 
= | 
a 

, ; 
foe 
hee. 

2 
ak j 
a 1c 

es 
oa 
. | 
a. f 
ee ‘ 
ee { 
oe 
o 
oe 
ee | 
ps 
“2s | | 
er i 
a ; 
od _ 
Ar i 
oy 
ist 

i 
ye: 

Prt: 

‘ae P 


JOURNAL OF THE AUDIO ENGINEERING SOCIETY 


APRIL 1953, VOLUME 1, NUMBER 2 


The Vibrating String as an Audio-Frequency Circuit Element 


Joun A. McWapw 
Byron Jackson Company, Los Angeles, California 


A nonmagnetic wire under tension in a magnetic field possesses many of the properties of a 
high-Q resonant LC circuit. The vibrating wire element has excellent stability and extremely high 
Q. These features, plus the fact that the wire can be tuned over a wide range of frequencies by 
varying the tension, result in a device which can be adapted to the measurement of physical 
variables used as a frequency or time standard or used as a bandpass filter or frequency discrim- 


inator in communication and control circuits. 


IS WELL KNOWN that when a string is stretched and 

plucked it emits a musical tone. For three thousand 
years strings and wires have been used only in musical in- 
struments. Such instruments make use of the fact that the 
pitch, or frequency of vibration, can be varied by changing 
the length of the string or the tension on it. Nowadays we 
are interested in vibrations, or oscillations, in electrical cir- 
cuits; so some curiosity concerning the oscillations of a 
stretched wire is justifiable if the wire can be made to look 
like an electrical circuit element. 

By itself, a vibrating wire or string is a purely mechanical 
element, and its frequency of vibration is determined by 
mechanical parameters such as length, density, and tension. 
If the wire is made of a nonmagnetic conduction material, 
is placed in a magnetic field, and is excited by some means, 
a voltage will be induced in the wire. The frequency of the 
voltage will be the same as that of the wire vibration. Con- 
versely, if the wire in the magnetic field is excited by an 
ac current of the same frequency as the resonant frequency 
of the wire, it will vibrate readily, and another ac voltage 
will be induced in the wire. 

By the introduction of a magnetic field the vibrating wire 
becomes an electrical circuit element. Its operation is 
analogous to a high-Q LC tank, possessing the stability char- 
acteristics of an electrically driven tuning fork or quartz 
crystal. Unlike the tuning fork or the crystal, the vibrating 
wire can be tuned over a wide range of frequencies, for 
example, as much as 3 octaves. 

If the magnetic field around the wire is distributed prop- 
erly along the length of the wire, the wave form of the signal 
generated by the wire will be essentially a pure sine wave 
having less than 0.1% harmonic distortion. By providing 
multiple magnetic fields arranged so that adjacent fields are 
oppositely oriented along the length of the wire, the wire 
can be made to vibrate at harmonics of its resonant fre- 
quency. Two sets of magnets will produce second-mode 
operation, and three sets of magnets will cause the wire to 
operate at its third harmonic. 

If a voltage is applied to a vibrating wire element, a cur- 
rent will flow. This current depends only on the resistance 
of the wire and the applied voltage, conforming to Ohm’s 


law, and is always in phase with the applied voltage. As 
the current flows through the wire, the wire deflects, owing 
to the presence of the magnetic field. The phase angle be- 
tween the deflection of the wire and the driving current 
becomes exactly 90° at resonance, varying between 0° and 
180° below and above resonance. The voltage induced in 
the wire depends on the rate at which the conductor cuts the 
magnetic flux, which in turn depends on the velocity of the 
wire. The velocity of the wire is greatest when its displace- 
ment is zero; hence the induced voltage is greatest at that 
time. The phase angle between the displacement of the wire 
and the induced voltage is always 90°, with the voltage lead- 
ing the displacement. Thus, at resonance, the induced 
voltage is in phase with the applied voltage. 

The ratio between the voltage across a vibrating wire 
element and the current flowing through the wire is the 
impedance of the element. At resonance this ratio is a 
maximum, since the applied and induced voltages are in 
phase. The ratio of impedance at resonance to the dc 
resistance of the wire is called the impedance ratio. Typical 
elements now being manufactured have impedance ratios in 
the order of 3 or 4 to 1, but ratios up to several hundred to 
1 are possible. 

At any particular frequency the impedance of the vibrat- 
ing wire at resonance is a function of the magnetic field 
strength and viscous damping force according to the relation 


Z = kB*/D 


The impedance of a particular element can be adjusted most 
conveniently by controlling D, which is a measure of the air 
density around the vibrating wire. This is accomplished by 
varying the degree of vacuum in which the wire operates. 

The impedance of the wire is a measure of the power 
required to drive the wire. For wires having impedance 
ratios of only 3 to 1, the power required to drive the wire at 
resonance is of the order of 107 w. This is typical of an 
element consisting of a 0.4 thousandth of an inch tungsten 
wire in a magnetic field of 6,000 gauss in a container which 
is evacuated to 28 in. of mercury. 

The bandwidth of the vibrating wire is defined as the 
difference between the frequencies at which the phase angle 
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VIBRATING WIRE 
SUPPORT ASSEMBLY 
o 


[ 


} INSULATOR 
LOWER END ROD b UPPER END ROD 


Fie. 1. Basie Vibrotron element. 


between the driving current and the induced voltage is 45°. 
The bandwidth is essentially constant at all resonant fre- 
quencies for any given set of parameters and method of 
construction. Thus, the Q of the wire increases with the 
resonant frequency. A Q of about 2,500 at 25 kc is typical 
of vibrating wire elements now being manufactured. The 
Q of a similar element with a resonant frequency of 12.5 
ke would be 1,250. In elements now being manufactured, 
bandwidth, and therefore Q, depend essentially on the stiff- 
ness of the wire. 

Vibrating wire instruments are being manufactured and 
sold under the name Vibrotron. The production instrument 
is about % in. in diameter and 2% in. in length. The fre- 
quency of the Vibrotron can be adjusted at the factory 
between about 1,000 cps and 30 kc. By means of suitable 
fittings and design variations, the wire frequency can be 
made to vary in proportion to some physical variable, such 
as force. This becomes an FM measuring system, in which 
all the information regarding the variable being studied is 
contained in the frequency of the signal rather than in its 
envelope. The frequency in cycles per second can be counted 
with great accuracy, providing accurate measurement of 
physical variables of the order of 1 part in 5,000 or better. 

The standard Vibrotron element, Fig. 1, is constructed of 
pure metals as far as possible. Tungsten and molybdenum 
were’selected because of their high tensile strength and low 
thermal coefficient of expansion. The vibrating wire is a 
pure tungsten filament 0.4 thousandth of an inch in diameter 
and from % in. to 1 in. in length. A magnetic field of 6,000 
to 8,000 gauss is supplied by two small Alnico V magnets. 
The wire is supported by tungsten end rods of about 40 
thousandths of an inch in diameter, the wire being silver- 
soldered to the end rods by means of a pinpoint flame under 
a microscope. The end rods and magnets are fastened to 
the support structure which is milled from a length of molyb- 
denum tubing. After the wire is brought to the proper 
frequency, or tuned, this assembly is placed in a protective 
case, and the interior of the case is evacuated so that the 
wire is operating in a vacuum. Electrical leads are brought 
out through a seal at one end. 

The frequency of the wire may be varied by changing the 
tension on the wire. A total deflection of the order of a few 
thousandths of an inch will tune the wire over its range. 
The If limit is set by the minimum tension at which the wire 
will vibrate efficiently, and the upper limit is set by the 


elastic limit of the wire. For a tungsten wire these limits J 
are about 25,000 psi and 225,000 psi, a range of 9 to 1. J 
Since frequency varies as the square root of tension, this 
represents a frequency ratio of 3 to 1. 

The electronic amplifier used with a vibrating wire element 
serves three purposes, as illustrated in Fig. 2. Specifically: 

1. It supplies energy to the wire to initiate and sustain 
vibration (feedback). 

2. It amplifies the signal generated by the wire to a useful | 
value in the order of volts. 

3. It limits the operating level of the system in order that 
the vibrations cannot build up to such a level as to cause 
damage to the wire or error in wire frequency (AGC). 

The circuit can take the form of a two-stage resistance- | 
coupled amplifier with positive feedback to the input, if the 
primary requirement is simplicity. A Mazda lamp can be 
used as a ballast element to limit the operating level of the 
system. The low impedance and low power requirements 
of the Vibrotron are particularly suitable for use with | 
transistors. 

The vibrating wire and amplifier comprise an oscillating 
system, the frequency of oscillation being determined by 
the resonant frequency of the vibrating wire. As a static 
system, this is a good frequency or time standard with the 
same order of long-term stability as that of a quartz crystal 
in the af range, nominally 1 part in 10° in a temperature- 
controlled environment. Power-supply regulation is not 
required for good frequency stability if the amplifier used 
with the vibrating wire is designed to maintain a constant 
voltage across the wire. Too great an increase in wire voltage 
causes the wire to deflect too far, increasing the average 
tension and resulting in a change of frequency. 

Because of its high Q and good stability characteristics, a 
vibrating wire element makes a good narrow-band frequency 
discriminator or bandpass filter. For a given combination 
of parameters such as wire material and dimensions, and a 
method of anchoring the wire ends, a vibrating wire element 
will have essentially the same pass band over quite a range 
of frequencies. This, together with the fact that the wire 
can be tuned over a wide frequency range, makes it useful 
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Fig. 2. Basie Vibrotron oscillator. 
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as a variable-frequency narrow pass band filter with constant 
bandwidth at all frequencies in its range. 

The bandwidth of a vibrating wire is determined primarily 
by mechanical parameters, and specifically only those affect- 
ing the length of the wire or the tension on it. The basic 
formula for the frequency of vibration of a wire under 


tension is 
T ) ” 
ol? 


Because of the fact that the wire has some stiffness, the ends 
of the wire are stiff for some distance from the points at 
which the ends are anchored as in Fig. 3. The frequency is 
determined, not by the measured length of the wire, /, but 
by an effective length,/—2 Al. For a constant amplitude 
of vibration, A/ can be varied by changing the ratio of the 
length to the diameter of the wire, or the material from 
which the wire is made. Vibration of the wire can be forced 
over a range of frequencies, the limits of which are deter- 
mined by / and / —4 Al. If the wire is long compared to 
its diameter, this bandwidth is substantially independent of 
the actual natural frequency of the wire. Therefore, if the 
frequency is varied by changing the tension, the bandwidth 
of the wire, considering it as a bandpass filter, will be essen- 
tially constant. 

For a tungsten wire 0.4 thousandth of an inch in diameter, 
the bandwidth is approximately 10 cycles. Used as a 
narrow-band frequency discriminator, such an element will 
change the relative amplitude of an ac signal voltage 6% 


for a 1-cps frequency change when operated along the reso- 
nance curve between the peak and the half power point. 

The stability of the vibrating wire element as an elec- 
trically resonant device and the ease with which it can be 
accurately tuned over a wide range of frequencies make it 
a useful tool in the fields of measurement, control, and com- 
munications. It is not a laboratory curiosity but a depend- 
able scientific instrument, being used in the atomic energy, 
guided missile, and other research fields. The vibrating wire 
element is a good example of an ancient principle being 
adapted to the solution of modern problems. 


LIST OF SYMBOLS 
__ Resonant frequency 


v= 
Bandwidth 
Z = Impedance 
B= Magnetic field strength 
D= Viscous damping factor 
k = A constant 
F = Frequency 
T = Tension 
o = Lineal density 
l = Length 
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The Application of Printed Circuits and Miniaturized Assemblies to Audio Amplifiers 


and Equipment 


Jesse KANAREK 
Hughes Aircraft, Research and Development Laboratories, Culver City, California 


ROUGHOUT ITS history, the electrical industry has 

been shackled to hand assembly of equipment with 
conventional wiring and a tedious succession of hand- 
soldered connections. Today, the need for smaller, more 
rugged and reliable electronic equipment of increasing com- 
plexity, coupled with rising labor costs, is creating a trend 
toward the eventual automatic assembly of such equipment. 
Fully automatic assembly may be a long way off, or it may 
be closer than imagined, but printed wiring and dip-soldering 
are techniques which are ready for use today. 

The design and production of printed wiring, by etching 
copper-coated plastics, is simple and versatile. It is a 
method of eliminating hand wiring, reducing human errors, 
and avoiding many of the skills presently required in elec- 
tronic equipment manufacturing. It is immediately adapt- 
able to all types of audio equipment design, and it can be 
used with ordinary standard components as well as minia- 
turized components, as will be shown in this paper. 

The printed circuit is essentially a World War II develop- 
ment. Thousands of proximity fuses utilizing printed circuit 
techniques were built during the war. Since the war, 
many government and industrial facilities have been en- 
gaged in furthering the development and usage of printed 
circuits. The armed services are particularly interested in 
printed circuits to facilitate maintenance and construction 
of the great quantity of equipment that would be required 
in an emergency. Contractors are developing new com- 
ponents and entire equipments utilizing printed circuit 
techniques. From these equipments and components shall 
come new design techniques and answers to the many prob- 
lems that may confront us in applying this information to 
mass production of printed circuit equipment. Already we 
see the application of some of these techniques to electronic 
equipment designed for use in the home. Plated wiring on a 
plastic chassis is being utilized by Motorola in a home radio 
which they are now marketing. This is shown in Fig. 1. 
Several television manufacturers are preparing printed cir- 
cuit chassis for distribution in 1953. These units are pro- 
duced by fully or semiautomatic techniques requiring large 
expenditures for new plant equipment. The added cost is 
offset by the fewer assemblers required to manufacture each 
unit. Automatic assembly techniques are also being studied 
by General Electric, the Stanford Research Institute, and 
others in this country for end use in civilian and military 
electronic equipment, respectively. 


Fig. 1A. Front view of Motorola ‘‘ Placir’’ chassis showing de- 
tails of tuning, dial, volume control, tube sockets, ete. 


Fic. 1B. Rear view of Motorola chassis showing mounted tubes 
and components. 


In England, Sargrove Electronics, Ltd., produced a home| 


receiver using only two double-function tubes by completely 
automatic means as far back as 1947. 
2, 3,4, and 5. Their technique was to spray a conductive 
material all over a plastic base with grooves and depressions 
supplied for coils, capacitors, and interwiring. The surface 
was then ground off, leaving a plastic base with metal- 
filled grooves and depressions. The Motorola receiver and 
the Franklin Airloop antenna utilize a different a 
for producing their printed wiring. 


This is shown in Figs.# 
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Fig. 2. Stages of manufacture of the Sargrove printed circuit 
chassis. 


Fie. 3. Front view of front printed circuit chassis of the Sar- 
grove receiver, completely assembled. 


Printed circuits have already made some headway in the 
audio field. Hearing-aid manufacturers have been practic- 
ing miniaturization techniques for years. Their goal has 
been to achieve high quality in a small volume. They were 
instrumental in developing progressively smaller components 
for use in their own products, which soon found their way 


into other electronic equipments. They pioneered in sub- 
miniature tube and transducer development as well as the 
introduction of new packaging techniques. They adapted 
printed circuits to these miniaturized assemblies, utilizing 
several techniques for applying the conductive material to 
the plastic base. Printed circuits have helped hearing-aid 
manufacturers reduce interwiring costs and make their equip- 
ment more reliable and shock-resistant. Figure 6 shows the 
plastic chassis which was used by one manufacturer to mount 
and wire a complete three-tube hearing-aid amplifier unit. 
Cutouts are provided for several components to facilitate 
mechanical and electrical connection to other circuit com- 
ponents. If the components are properly mounted, there 


Fig. 4. Rear views of front and back printed circuit chassis of 
the Sargrove receiver, completely assembled. 


Fic. 5. Front view of rear printed cireuit chassis of the Sargrove 
receiver, completely assembled. 
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Fig. 6. Left, British silver-fired-on-glass chassis showing printed 
resistors and plug-in properties at bottom. Right, conductive mate- 
rial on a plastic base used as a hearing-aid chassis. 


can be no errors due to interwiring. Each unit is wired 
alike, both physically and electrically, and feedback is pre- 
vented by using short leads to connect components. 

Although the audio equipment described above is designed 
with miniaturization in mind, printed circuits will find wide 
application in equipment where this is not of prime import- 
ance. Printed circuits are just as suitable for use with 
standard components and normal wire spacing, for it still 
offers economies in point-to-point wiring, component han- 
dling, and subsequent lowering in rejection rates. 

The printing of electronic circuits can be accomplished 
by several techniques. One method is the spraying of a 
conductive coating on a grooved base buard or application 
by means of a stencil. A second method is the plating of 
metal on an insulating base material by selective methods 
and precoatings. Stamping of conductors from a solid sheet 
is a third method that has been used extensively. The fourth 
and most promising method is the use of a metal-clad in- 
sulating base material in the photoetching process. This 
method is closely related to photoengraving. The simplicity 
and economy in this process is made even more desirable 
by the fact that it requires lower capital investment than 
the other processes. 

To describe the etched circuit process, a brief outline will 
be given of the steps that are performed between the finished 
breadboard design and final inspection. When the design of 
the equipment electronically has been completed and the 
unit is being readied for packaging and production, the 
printed circuit designer is consulted. He examines the layout 
and suggests changes that are in keeping with good printed 
circuit design practice. He recommends changes in the com- 
ponents that have been used in the experimental model. He 
suggests improvements that will make the equipment more 
reliable and more adaptable to rapid assembly. The only 
important limitation on the design at this point is the use 
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of two dimensions instead of three; because of this limitation, 
the particular size and shape of a component may determine} 
its suitability for use with etched circuits. A means for 
utilizing the third dimension will be discussed later. 


Having completed this phase of the design, the engineer 
is ready to set up the physical layout of the components on | 
the printed board. An extremely important consideration 
in printed circuit design also is the prevention of feedback 
or cross-modulation encountered in hand-wired units. Al- 
lowances must be made for bending of component leads and 
interconnection of printed wiring, some of which may be 
placed under the components to save space and materials. 
Where the components join the terminations of the printed 
lines, the line widths are increased to accommodate the] 
component leads that come through the base board and are] 
soldered at these points. This is usually accomplished by 
drawing a circle having a diameter twice the width of the 
printed circuit line. This assures that sufficient copper will 
be present to make a good electrical connection to the com- 
ponent lead. When the layout of all the components, tube 
sockets, plugs, and hardware is completed, a drawing of the 
printed circuit is made in India ink on illustration board. 
Since printed circuits may be made on two sides of the 
insulating board and some lines will join by interconnection 
on the board, care must be taken to insure that these inter- 
connecting points are exactly aligned on the finished board. 
Upon completion, the drawing is checked and reduced to 
exact scale by photography to full size. 

In the photoetching process of manufacturing printed 
circuits, the photographic negative is used. In this process, 
the copper- or metal-clad insulating board is coated with a 
light-sensitive enamel. Thus coated, it is like an ordinary 
photographic printing paper. After exposure and etching 
of the unexposed surface, the printed circuit will appear 
as a metal foil laminated to the nonconductive material. 

For the silkscreen method of production, the positive 
transparency instead of the negative is used to cut the 
silkscreen. This method is a familiar one in the textile 
industry, where the process is used extensively. For the 
silkscreen technique, the board is placed tightly against 
the screen and an acid-resistant paint is squeegeed over the 
pattern evenly. No light box or developer is required. 
The board is ready for the etching bath immediately. The 
silkscreen process, as compared to photoetching, has the 
advantage of speed and economy and lends itself more 
readily to mass production of a single type of circuit. On 
the other hand, finer detail can be achieved with photoetch- 
ing. Some detail is lost with silkscreen because of the 
limitation of the size of the mesh of the screen. In most | 
cases the loss in detail is of little consequence. | 

The boards used in this process are composed of a con- | 
ductive material bonded to an insulating base. The con- | 
ductive material is generally copper, although laminates are | 
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available with metal foils of silver, brass, or aluminum. 
Insulating bases are supplied in several types of phenolics. 
Teflon-impregnated fiberglas, melamine- and _ silicone-im- 
pregnated fiberglas, polystyrene, epoxy base materials, and 
several others have been used for special purposes because 
of their particular characteristics. The circuits may be 
formed in many physical shapes by using flexible base ma- 
terials such as silastic-impregnated fiberglas, cloth, and 
many others. 

The etched boards are then cleaned, inspected, and sent 
to the mechanical processing department for drilling and eye- 
letting as required. In the past, components were soldered 
directly to the etched lines with no other mechanical sup- 
port. Upon replacement of components in the normal 
course of maintenance the metal foil occasionally would lift 
off the board as a result of loosening of the bond by heat. 
To prevent this, eyelets are now used in the component 
holes. 

After the drilling, eyeletting, and processing, the etched 
board is ready for assembly. At the present time, com- 
ponents are inserted by hand into the eyeletted holes and 
the leads are bent so that the components will not fall out. 
Several machines are being developed to prebend the leads 
to a predetermined eyelet spacing, cut the leads at a pre- 
determined length, and slightly flatten the ends. When the 


leads are inserted in the board either by hand or with a 
special machine, they “snap” into place and cannot fall 


out. The Stanford Research Institute and others are de- 
veloping machines which will completely assemble all the 
components automatically. 

The assembled unit is inspected for correct placement 
of components and is then fluxed. Fluxing can be accom- 
plished by either hand-brushing or dip-coating techniques. 
One group is experimenting with combining flux with the 
ink used in the silkscreen process so that fluxing and print- 
ing of the circuit are accomplished simultaneously. After 
fluxing, the board is ready for dip-soldering. Areas where 
solder is not desired can be masked off prior to dip-solder- 
ing. The length of time that the board is left in the bath 
must never exceed 10 seconds; otherwise the heat of the 
molten solder will loosen the bond between the foil and the 
insulating base. On the other hand, the board must remain 
in the solder bath long enough to heat the connection ade- 
quately; otherwise cold solder joints will result. 

The assembled board is now inspected for good solder 
connections. Any globs of solder or incomplete connections 
can be dressed with a hand iron. After the operator has 
developed good dipping technique, or if a dipping machine 
is used, when the cycle is properly adjusted this step should 
not be necessary. The board is then sent to the next de- 
partment for mechanical mounting in its case. The vacuum 
tubes and plug-in units, if any, are then inserted in their 
sockets, and the unit is tested. 
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Although the description of this process has been brief, 
the major steps have been covered. The simplicity of using 
the miniaturization techniques has been stressed. There are 
many precautions to be observed when this process is used, 
but with development of good techniques and familiarity 
with the equipment available, many economies can be effec- 
ted in production and labor costs. In regard to the latter, 
further mention should be made of automatic component 
assembly techniques. Production engineers working indi- 
vidually at several companies are thinking in terms of auto- 
matic electronic factories for the future. Although most of 
the assemblies they are working on are classified, informa- 
tion regarding some of these designs for automation is 
available to the public. Starting with a copper-clad insulat- 
ing board, machines will print circuits, etch, drill holes, 
mount components, flux, solder, and completely assemble the 
finished unit in a package, all by completely automatic 
methods. These automatic machines are not yet ready for 
mass production, but the ideas behind them are mechanically 
sound and have been used in other industries for years. We 
may therefore look to a slow evolution in the electronic 
industry in the near future—first to semiautomatic, then to 
fully automatic machinery. 

The inflexibility of such machines will not make them 
suitable for short manufacturing runs, however. Such 
standard units as amplifiers, small home radios, power sup- 
plies, and similar items which may undergo changes in com- 
ponent values but not in wiring layout design will be suit- 
able for automatic factory techniques. The use of trans- 
formerless amplifiers, ferrite core materials, and other means 
of circuit simplification will serve to reduce the cost, size, 
and weight of the quality and quantity of equipment without 
any serious reduction in fidelity, gain, or power output. 

Although the components used today by etched circuit 
designers are the same as those used in ordinary hand-wired 
units, several progressive manufacturers have been develop- 
ing new components designed specifically for etched circuits. 
As mentioned earlier, tube sockets that plug directly into 
the etched circuit board are now available, as are etched 
circuit plugs for making connections to external wiring 
or to other etched circuits. By using several of these plugs 
suitably connected to an etched board which serves as a 
wire harness or bus chassis, several boards may be stacked 
close together. The etched circuit tube sockets now avail- 
able do not permit this stacking closer than the height of 
the tubes used, but developments are under way to produce 
a right-angle tube socket which would lay the tube down 
almost on the chassis with only a minimum of space left 
for a barrier to the radiated heat. This would permit close 
stacking of boards and make plug-in amplifier units of given 
gain, bandwidth, and particular frequency-response char- 
acteristics interchangeable. The reduction in weight and 
volume may be considerable. Circuit changes of a major 
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nature could be made without rewiring the chassis, and pro- 
duction delays would be reduced or eliminated. Certainly 
many audio equipment manufacturers and experimenters 
would buy subunits rather than build them themselves. A 
wide range of commercial possibilities is being opened up 
by these new components, subassemblies, and design tech- 
niques. 

To illustrate many of the printed circuit materials and 
techniques currently in use, a conventional two-stage ampli- 


© © 


OUTPUT CIRCUIT 
WITHOUT PLUG-IN UNIT 


OUTPUT CIRCUIT 
WITH PLUG-IN UNIT 


Fig. 7. Output circuit design with and without plug-in unit. 


2- STAGE AMPLIFIER CIRCUIT 


PHASE INVERTING 
PRE-AMPLIFIER 


PLUG- IN AMPLIFIER UNIT 


Fig. 8. Complete circuit of printed cireuit amplifier with plug-in 
unit. 


Fig. 9A. Methods of construction of plug-in units. 
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Fie. 9B. Stages of construction of printed circuit chassis. 


Fie. 10B. Completed unit, assembled, without cover. 


fier has been designed and constructed. One half of a 12AU7 
is used as an amplifier and feeds the phase inverter stage 
which makes up the other half of the tube. The output 
feeds a plug-in unit which is used to increase the gain and 
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improve the frequency response. Removal of the plug-in 
unit does not break the circuit. This is accomplished by 
the use of three resistors in each channel. With the plug-in 
unit inserted, the three resistors function as a feedback 
loop. With the plug-in unit removed, they become a low- 
loss T pad. This assures some output for the amplifier even 
if a failure were to occur in the plug-in unit. The design 
for this circuit is shown in Fig. 7. The entire circuit con- 
sisting of the 12AU7 and the 6111 amplifier in the plug-in 
unit is shown in Fig. 8. 

The amplifier is designed for semiautomatic production 
with a probable reduction in labor costs and rejection rate 
due to interwiring errors as compared to a similar hand- 
wired unit. All the components—tube sockets, etched circuit 
plug, and output connector—are dip-soldered in a single 
operation. Both standard-size and special miniature com- 
ponents have been used to illustrate the versatility of the 
printed circuit progress. In Figs. 9 and 10 the various steps 
in manufacture of this unit are illustrated from the time of 
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removal from the etching bath to the completion of the 
product. Data and performance curves on the unit are 
available from the author but were not considered as im- 
portant as the manufacturing techniques that are illustrated 
by the unit itself. 

In summary it may be said that printed circuits will find 
applications in such diverse audio equipments as intercom- 
munication units, standard plug-in amplifiers, filters, cross- 
over networks, line pads, and many other subassemblies. 
Audio equipment manufacturers and designers should im- 
mediately recognize the fact that printed circuits and 
miniaturized assemblies are suitable for their products and 
can effect many economies in manufacture. Printed circuits 


are not only a development of the future; they are available 
right now. 

The writer wishes to thank his associates of the Minia- 
turization Group at the Hughes Aircraft Company for the 
numerous valuable suggestions and corrections they made 
in connection with this paper. 
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IHE D-42 MICROPHONE was developed originally for 
use with the acoustic firing error indicator.’ Briefly, 
this equipment consists of two microphones which are 
mounted at opposite ends of a plastic sphere in a towed 
sleeve target and which respond to ballistic shock waves 
and modulate tiny FM transmitters. Two receivers near 
the gunner actuate indicators that show this distance and 
inform the gunner whether he is leading or lagging a target. 
When the speed of a projectile relative to the air mass 
is appreciably greater than the speed of sound, the pro- 
jectile sends out from the trajectory acoustic waves known 
as ballistic shock waves. The intensity of the ballistic 
shock waves existing at a given point in the air is a char- 
acteristically diminishing function of the distance from the 
projectile. In the firing error indicator, two microphone- 
and-transmitter units are mounted in the airborne target 
and linked by radio with a radio receiver near the gun. 
The target airborne units send quantitative signals to this 
receiver that are indicative of the intensity of the shock 
waves from the projectiles as they pass in the vicinity of 
the target. This idea was carried through development and 
production stages at the California Institute of Technology 
during World War II under NDRC contract No. OEMsr600. 
Since these airborne microphone and transmitter units 
were expendable, the D-42 microphones for this use pre- 
sented a major problem in the research and development 
of the contract. The D-42 condenser microphone is shown 
in Fig. 1. A high degree of reproducibility as to acoustic 
characteristics coupled with relative cheapness of manu- 
facture and ruggedness of mechanical construction were 
required. The aim of this development was to obtain fre- 
quency flat response from zero frequency up to as high 
an audio frequency as was practicable, with the idea of 
utilizing the diaphragms as far as possible in their elastic 
regime of response. 

The ultimate approach is a microphone whose diaphragm 
deflection will follow faithfully, as a function of the time, 
the instantaneous value of the pressure associated with the 
shock waves, especially as regards the sudden changes in 
the discontinuities of the N-wave.* The characteristic N- 
wave contour of a shock wave is shown in Fig. 2. This is 


1 Marcus C. Eliason and W. G. Hornbostel, Acoustic Firing Error 
Indicator, Electronics, October, 1952, p. 98. 
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Fig. 1. The D-24 microphone, showing the rugged construction 
and also its dimensional scale. 


H 
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Fig. 2. A characteristic N-wave contour of a shock wave. This 
is a simplification of actual ballistic shock waves. 


a composite simplification of actual ballistic shock waves 
as reproduced by a hf quartz-plate microphone and photo- 
graphed from the face of a high-speed single-sweep oscillo- 
scope. This indicates the very abrupt rise and fall time 


2 J. W. M. DuMond, E. R. Cohen, W. K. H. Panofsky, and E. 
Deeds, A Determination of the Wave Forms and Laws of Propagation 
and Dissipation of Ballistic Shock Waves, J. Acoust. Soc. Amer., 
July, 1946, p. 97. 
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of the beginning, H, and the end, 7, of such a wave form. 
The duration of the transfer slope between H and T is 
determined by the length of the projectile and the miss 
distance, or the length of time the shock wave has been 
propagated. Because of the extreme abruptness of these 
transitions, this necessitates flat response up to a frequency 
of a megacycle per second, or more, a requirement too diffi- 
cult to meet. Therefore, the objective is to push the natural 
frequency of the diaphragm to the highest practical figure 
consistent with good reproducibility and to keep the re- 
sponse curve as flat as possible up to highest possible fre- 
quency value by suitable damping. 

From these considerations it is clear that, if reliable 
shock-wave intensity measurements are to be made, a 
microphone with the following exceptional characteristics is 
required: (1) It must have very good hf response. (2) It 
must not have any serious resonance responses within the 
band of audio frequencies to be measured. (3) The fre- 
quency vs sensitivity characteristics must be as nearly uni- 
form as possible. (4) Since very high momentary sound 
pressures of about 30,000 dynes/cm? and higher are normal- 
ly encountered, and levels below 2 dynes/cm* need to be 
measured accurately, the microphone must have a high 
degree of pressure response linearity. (5) Since the micro- 
phone is to be mounted in flag targets at speeds up to 300 
mph under extreme variations of moisture and temperature 
conditions, it must be very rugged, mechanically stable, and 
insensitive to temperature variations. (6) The unit must be 
considered expendable, and therefore the design and pro- 
duction techniques must be such that it can be readily 
adapted to mass production with uniformity and high 
quality. 

Since the most feasible telemetering system under the 
field conditions to be met involved the use of a frequency- 
modulated transmitter, this immediately suggested the pos- 
sibility of using a capacity-type microphone as part of a 
tuned circuit of a simple master-oscillator power-amplifier 
type of transmitter. Such a unit was designed, constructed, 
and tested with gratifying results, indicating that further 
design and experiment was warranted. On the basis of the 
results obtained, an extensive program of research, develop- 
ment, and preliminary production for field testing was insti- 
tuted. Elaboration of these factors involved in the design 
of the D-42 capacity microphone will be covered here. An 
exploded view of the microphone is shown in Fig. 3. 

Two factors made a rather extensive program of experi- 
mental production unavoidable. These were (1) the ex- 
pendable nature of the transmitter units and (2) the fact 
that the problem of obtaining a high degree of unit-to-unit 
reproducibility as to sensitivity had to be solved. The 
exploded view shows the five parts: (1) the clamping ring; 
(2) the beryllium-copper diaphragm 1.6 mils thick; (3) 
the frame; (4) the stretching button, consisting of a 


Fig. 3. An exploded view of the D-24 microphone: (1) clamping 
ring; (2) beryllium-copper diaphragm; (3) frame; (4) stretching 
button; and (5) lock nut for the stretching ring. 


threaded brass ring with insulated back electrode mounted 
on a steatite insert; and (5) the lock nut for the latter. 
The diaphragm, which is under a tension approaching its 
elastic limit, is securely held by sixteen screws through the 
clamping ring into the frame. It is assembled under pre- 


stress by means of a special jig, and the final tension is ad- 
justed by screwing the button up against it from the back 
to the proper degree until its natural frequency is 12,000 


cps. The diaphragm clears the back electrode by only 
0.000985 in. This gap is determined by special machining 
and testing methods when the electrode button is made. 
The small clearance produces very high diaphragm damp- 
ing, so that there is no sustention of a natural diaphragm 
frequency, and this frequency can therefore be determined 
only by special methods with an “electrostatic tester.” The 
gap is such that the microphone frequency-response curve is 
very flat over the range from zero to 12,000 cps. These 
characteristics and their stability against temperature varia- 
tions are very essential to the success of the D-42 micro- 
phone. Matching of thermal expansion coefficients has been 
carefully attended to. Artificial aging of the microphones 
by temperature cycling is an important step in the manu- 
facture. 

The active diameter of the diaphragm, defined by the 
inside diameter of the threaded stretching ring, is 0.75 in.; 
this conveys the idea of the dimensional scale of the micro- 
phone as shown in Fig. 3. It is an essential feature that 
the clamping ring be very shallow so that in the assembled 
unit there will be no marked acoustic shading or interfer- 
ence effects produced by this ring. The space behind the 
microphone diaphragm must have a vent to the outside 
air to equalize changes in pressure accompanying changes 
in altitude. The time constant of this vent is extremely 
important. For the high frequencies involved, this vent 


ee 209 e. 

t: 

| Q 0 " 

MINN? & 
ti — aoe a q 

a ee 

© : O 

—— 


210 


must present practically infinite impedance. Venting of 
a cavity is accomplished by making a light radial V-groove 
across the stretching ring profile before the final machining 
cut is made. This permits adequate venting and still pre- 
vents dirt or moisture from penetrating directly. With care, 
no damage is done to the diaphragm because of the narrow 
gap of the groove. 

The insulating button is a high-grade steatite lapped 
to have flat faces and impregnated with silicone to avoid 
moisture absorption. The steatite has excellent hf char- 
acteristics, making it possible to use the microphone in the 
tuned circuit of the oscillator with excellent results. 

Production methods had to be developed to tune all 
diaphragms to the same frequency. Since the critical damp- 
ing made a resonance amplitude-determination procedure 
impossible, a special method of electrostatically exciting the 
diaphragm during assembly was worked out. By measuring 
the resulting response amplitude and phase relation to driv- 
ing force, it became possible to determine resonance and 
frequency response simultaneously during production. 

The critical air-gap spacing made it necessary to pre- 
stress all diaphragms to a predetermined tension before 
mounting on the frame. The final stressing is done with 
electrode and stretching ring assembly after the diaphragm 
is fastened in place, thus avoiding the various degrees of 
bending which occurred with unstretched diaphragms. Pre- 


stressing causes the active portion of the diaphragms to 


become essentially flat and parallel with the electrode face 
at all points. By cone-contouring the stretching ring, the 
boundary is sharply defined and the bending pressure is 
concentrated at a definite point or sharp edge instead of 
over a rounded surface. The final stressing of the diaphragm 
raises the active portion somewhat and results in an almost 
flush surface. 


The clamping ring is machined at a slight angle, giving 
a spring washer locking action to the screws and assuring 
a positive clamping action on the diaphragm at the inner 
edge of the ring. The screws are driven home with a motor- 
driven screwdriver set for maximum safe and uniform 
tightening pressure. 


One of the first problems that developed in production 
was traced with some difficulty to the fact that small dif- 
ferences in diaphragm thickness resulted in variations of 
mechanical resonance responsiveness. By using the same 
electrode button to insure no variation in spacing, outside 
ring size, and electrode button size, it was found possible 
to obtain frequency-response characteristics from various 
diaphragms and casting assemblies that varied from 3 db of 
underdamping to 4 db of overdamping, with a median group 
of characteristics of 0.5 db over the desired range. This 
variation was traced to differences in thickness of diaphragm 
material due to nonuniformity in rolling of the stock. Care- 
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ful heat treating assured uniformity of grain structure and 
tensile characteristics. It was found possible to readily 
segregate the different thicknesses by weighing the blank 
diaphragms before assembly. By grouping these blanks 
into classes differing by approximately 20 mg and then ad- 
justing the air-gap spacing in the microphones by minute 
amounts for the individual groups, relatively little difficulty 
was found in producing uniform microphones. 

A special technique had to be developed in order to hold 
the air-gap space to the required +5-microinch maximum 
variation from the mean value of 0.000985 in. This method 
utilizes the viscous resistance to the flow of air through the 
gap formed by the electrode button and the face of a very 
carefully machined and lapped testing cap which is 
pressed over the electrode ring assembly while it is in the 
lathe undergoing the final machining operation. Air is 
drawn through a small hole bored in the tester cap centered 
directly above the electrode button. The airflow is estab- 
lished with a vacuum pump and set to a standard value with 
a regulating valve. The pressure drop across the air gap 
is measured with a mercury manometer. The sensitivity 
is such that 1 mm of mercury is equivalent to about 10 
microinches and can easily be read to +1 microinch. Al- 
though even greater sensitivity can be achieved by using a 
manometer fluid of lower density, this degree of reading 
accuracy was found to be adequate. 

The final cutting operation is performed by lapping the 
rim of a stretching ring with ground-glass plate. If, dur- 
ing assembly, a particular diaphragm is found to be out 
of its classification group, the assembly is returned to the 
machining operation with the direction and magnitude of 
improper damping noted, and it can then be readily ad- 
justed to match its particular characteristic with a mini- 
mum of lost time. 


2 STAGE 
LIMITER AND 
DISCRIMINATOR 
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TUNING 

METER 


AF AMPLIFIER 


SIMPLIFIED DIAGRAM OF THE 
ELECTROSTATIC MICROPHONE TESTER 


Fig. 4. Simplified diagram of the electrostatic microphone tester 
used in production testing the special flat-response D-24 micro- 
phones. 
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It was found possible to make small adjustments in air- 
gap spacing during assembly by taking advantage of the 
fact that the stretching ring causes a localized uniform 
circular deformation in the diaphragm at the contact point. 
Thus, by stretching the diaphragm to some point beyond 
the normal resonance point and then backing off, the center 
of the diaphragm tends to bow in, reducing the air gap by 
an extremely small amount, but in most cases enough to 
establish proper damping. Taking advantage of this phe- 
nomenon has saved hours of disassembly and regrinding and 
has reduced somewhat the final machining accuracy re- 
quirement, resulting in considerable time saving. 

The need for a simple, easily operated, dependable meth- 
od of determining the acoustic response characteristics led 
to the development of the “electrostatic microphone tester.” 
This instrument was based on the principles outlined by Dr. 
L. J. Sivian.* 

The electrostatic method of testing microphones repre- 
sents a departure from the usual acoustic methods of study- 
ing microphone response characteristics. Instead of driv- 
ing the diaphragm by the external application of alternating 
acoustic pressures, it is driven by electrical forces coming 
from an alternating electrical field applied across the air 
gap of the condenser microphone itself. 

A simplified circuit diaphragm is shown in Fig. 4. It 
shows the microphone being used as a capacitive element 
of a tuned rf oscillator. A dc polarizing voltage is applied 
to the insulated electrode button to avoid harmonic genera- 
tion. The second harmonic driving force, which results from 
the fact that the driving force is proportional to the square 
of the field, is minimized by maintaining the polarizing 
voltage substantially larger than the peak value of the ac 
driving voltage according to the formula: 


Peak ac voltage 1 
Second harmonic force = . 


2 polarizing voltage 


An af voltage is applied to the microphone case from a 
low-pass, low-impedance, variable-frequency generator, caus- 
ing the diaphragm to move at the af rate by electrostatic 
attraction. The resultant capacitance variation changes 
the rf oscillator frequency. The resultant frequency-modu- 
lated signals are multiplied in the plate circuit to increase 
the effective deviation, then fed through a two-stage limiter, 
discriminator, and negative feedback stabilized audio ampli- 
fier. 

A sample voltage is fed from the audio generator to the 
horizontal amplifier of a high-quality oscilloscope with very 
uniform phase characteristics. The output of the tester is 
fed through the vertical amplifier and checked for ampli- 
tude and phase characteristics. When the D-42 microphone 


3L. J. Sivian, Absolute Calibration of Condenser Transmitters, 
Bell System Tech. J., 10, 96 (January, 1931). 


is critically damped, the resonance point of the diaphragm 
can be detected only by the fact that the diaphragm motion 
is 90° out of phase with the applied voltage. In this way 
it becomes possible to measure and control the character- 
istics of the microphones under production conditions. The 
characteristic response under electrostatic deflection is es- 
sentially identical to that which will occur if acoustic excita- 
tion is used. 

When the electrostatic method of microphone sensitivity 
testing is used, conversion to acoustic sensitivity can be made 
only if the geometry is accurately known. Under uniform 
acoustic pressure excitation the surface of the diaphragm 
is a zero-order Bessel function revolved about the micro- 
phone axis, whereas under electrostatic excitation the dia- 
phragm cross section is a logarithmic contour extending over 
the area of the back electrode. In addition to this differ- 
ence in diaphragm contour, we have to remember that the 
driving force at constant driving voltage is inversely pro- 
portional to the square of the gap spacing d. If we define 
the pressure sensitivity of the microphone by dC /dp (change 
in capacitance per unit pressure) and the electrostatic sensi- 
tivity by dC/dE,, (change in capacitance per unit peak 
driving voltage), then it can be shown that 


d » { b 
== (2 )] 


dC 
dE,, 


where 


( b ) b* flog. (a/b) +m 

J a a J, (24b/a) 

a being the active radius of the diaphragm and 3 the radius of 
the electrode. With the aid of these relations and a calibra- 
tion of the output of the oscillator circuit, the absolute 
sensitivity dC/dp can be computed from input, output, and 
polarizing voltage measurements. 

The major source of phase shift in the microphone tester 
is in the discriminator circuit and is equal to approximately 
12° at 10,000 cps. For production purposes this can be 
ignored. An exact determination of resonance can be made 
by operating the microphone in a vacuum so that the me- 
chanical resonance becomes very sharply defined by a re- 
sponse peak. The apparent difference in phase can be 


ascribed to the tester. 

Amplitude-response curves of D-42 audio microphones 
can be obtained either by the use of an output level meter 
or by comparing the input and output level by means of a 
calibrated attenuator as the frequency is varied. 

The electrostatic microphone tester is used to check: (1) 
amplitude response of the condenser microphone, (2) phase 
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response of the condenser microphone, and (3) sensitivity 
measurement of the condenser microphone. 

The choice of beryllium-copper for diaphragm material 
was based on its high yield point, availability in thin gauges, 
and freedom from corrosion. Aluminum alloys were tried 
but were rejected because of the severe requirements of 
the equipment. Stainless steel probably could have been 
used, but it was unavailable in the required thinness, uni- 
formity, and width. The beryllium-copper diaphragm is 
1.6 mils thick and is rolled 6 numbers hard, but without 
heat treatment. It is then cut into disks 2.5 in. in diameter 
and perforated with sixteen holes equally spaced on a 144- 
in. circle concentric with the disk. These are the holes 
for the screws which eventually hold down the clamp of 
the microphone frame. 

To minimize changes in response characteristics of sensi- 


tivity, the microphone body is made from a special bronze 
alloy having nearly the same thermal expansion coefficient 
as the beryllium-copper diaphragm. By this manner it is 
possible to make a microphone whose resonance frequency 
changes only 1% for each 10°C change in temperature. 
Final stabilization is accomplished by temperature-cycling 
all microphones from —40° to +75°C four times. After 
cycling the microphones are checked for uniformity char- 
acteristics. If microphones after cycling do not check with- 
in a few per cent of the original characteristics, they are dis- 
carded. 

The successful original development of the microphone 
described was to a large extent due to the direction and 
effort of J. W. M. DuMond, of the California Institute of 
Technology, and W. K. H. Panofsky, now at Stanford 
University. 
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A Low-Noise FM Recording System 


Watter T. Seistep* 
Ampex Electric Corporation, Redwood City, California 


This paper describes the use of FM carrier techniques in magnetic tape recording for the purpose 
of improving the overall signal-to-noise ratio of a magnetic recording system. 

The system to be described has been demonstrated to have appreciably better than 70-db 
signal-to-noise ratio relative to a distortion level of 1%. 


E PRESENT technique of magnetic recording has 

several limitations as far as signal-to-noise ratio is con- 
cerned. The more important ones are: (1) amplifier noise 
such as hum, hiss, and microphonics; (2) modulation noise 
due to tape grain structure; (3) modulation noise caused by 
hf flutter; and (4) hf tape noise present even in the absence 
of modulation. The first limitation, that of amplifier noise, 
is due primarily to the fact that the output of the playback 
head is a function of the rate of change of flux rather than 
the flux density recorded on the tape. The output from 
such a playback head at low frequencies is therefore very 
low, and hum and other noise problems in the amplifier 
become of great importance. At high frequencies, the output 
from the tape also is reduced to a very low value because of 
the extremely short wavelengths which are being recorded 
on the tape. As a result, tube hiss and microphonics effects 
can also be the limitation as far as signal-to-noise ratio is 
concerned. The second factor, that of modulation noise 
due to tape grain structure, is probably the least important 
of the limitations and need not be covered in any detail. 
The third limitation, modulation noise due to tape flutter, 
is probably the most important of those mentioned here. 
This type of noise behind the signal is due primarily to the 
scraping of the tape across guides and heads and is a func- 
tion of tape surface irregularities and lubrication. Since 
even the best of tape transport mechanisms may introduce 
as much as 1% flutter in the region of 1,000-5,000 cps, 
it is easy to understand how modulation noise due to this 
type of tape speed irregularity may easily be only 35-40 db 
below the signal level. Modulation noise of this type is most 
commonly noticed during the reproduction of If notes such 
as those produced by an organ. The fourth limitation, that 
of hf tape noise, is particularly important when slow tape 
speeds are used for wide-range recording. Under such con- 
ditions the output from the tape at very high frequencies is 
extremely low. As a result, post-emphasis must be included 
in the amplifier, which results in amplification of the white 
tape noise components that exist as a result of nonhomo- 
geneous tape coatings and random magnetization thereof. 
In order to overcome these limitations and various other 
minor ones, the use of a frequency-modulated tape recording 
system was considered. 


* Chief Engineer. 


To verify the advantages of this recording method, an 
experimental system was constructed. The FM recording 
system tested for use as an audio recorder is shown in block 
diagram form in Fig. 1. This recording system consists of 
three units, as do most recording systems made. These are 
the record amplifier, the tape transport, and the playback 
amplifier. In the FM system the record amplifier consists 
of a frequency-modulated multivibrator coupled to the record 
head through a buffer amplifier. The frequency-modulated 
multivibrator is modulated by inserting the audio signal in 
the negative return of the grid resistors. This type of modu- 
lator oscillator can be modulated plus or minus 60% in 
frequency, with a linearity far better than 1%. For use as 
an audio recorder, the center frequency of the modulated 
oscillator is chosen at approximately 50 kc. This means 
that with a plus or minus at 60% frequency shift, which 
is considered 100% modulation level, the negative frequency 
swing of the oscillator will go to 20 kc and the upward fre- 
quency shift will reach as high as 80 kc. The buffer amplifier 
delivers sufficient power to the record head so that tape 
saturation is assured. This eliminates need for an erase 
oscillator, since previously recorded signals will be wiped 


out in the process of re-recording the tape. The signal-to- 


noise ratio of the record amplifier portion of the recorder 
can very easily be held to better than 80 db below 100% 
modulation. 

The next unit to be discussed, as seen in Fig. 1, is that 
of the tape transport. In the ordinary type of magnetic 
recorder, tape speed flutter shows up as modulation noise. 
In the FM magnetic recording system, tape flutter shows up 
as background noise, or as a limit to the signal-to-noise ratio 
possible with this system. In order to achieve a signal-to- 
noise ratio of approximately 70 db, it is necessary that the 
tape transport system have extremely low flutter. Since 
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Fic. 1. Block diagram of the FM recording system. 
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the deviation of the modulated carrier is 60% peak, the 
rms deviation is 0.707 times 60%, or approximately 42%. 
In order that we achieve only a 40-db signal-to-noise ratio, 
the flutter introduced by the tape transport must not exceed 
0.42%. For a 60-db signal-to-noise ratio, the flutter would 
have to be as low as 0.042%. For an 80-db signal-to-noise 
ratio, the flutter would have to be 0.0042%. Since the 
flutter measured throughout the audio spectrum introduced 
by the transport used in this test was 0.012%, the flutter 
produced a background noise level which was 71 db below 
100% modulation. The conventional type of tape transport 
mechanism does not have sufficiently low flutter to accom- 
plish such a signal-to-noise ratio with the FM system. The 
tape transport is that which is used in the standard Ampex 
Model 500 tape recorder designed primarily for telemetering 
recording. The drive mechanism shown in Fig. 2 illustrates 
the basic principles involved in this type of tape transport. 
You will note that the tape is wrapped around the capstan 
with the oxide surface outward and held firmly to the surface 
of the capstan by a vacuum system. While the tape is in 
intimate contact with the capstan, the record and playback 
heads are held against the oxide surface by spring pressure. 
Since the unsupported tape length in this type of tape trans- 
port is extremely short, the components of flutter above 500 
cps are negligible. The flutter components below 500 cps 
are due primarily to mechanical design and can be. held 
rather conveniently to extremely low values. Since the 
capstan rotation rate is approximately 6 cps, the components 
of background noise introduced by it will not be heard as 
they are below the audible spectrum. Components of flutter 
introduced by the vacuum valving system and the porting 
on the capstan surface contribute most of the 0.012% 
flutter which limits the signal-to-noise ratio to 71 db. It is 
conceivable that with great care the components introduced 
by the valving and porting might be still further reduced. 
Figure 3 presents a cutaway view of the capstan and 
head assembly, showing how the tape contact to the head 
is accomplished. It will be noted that the capstan is 
equipped with grooves in its surface located such that the 


Fie. 2. Principle of the tape transport system. 


Fig. 3. Cutaway view of the capstan and head assembly. 


heads contact the tape opposite these grooves. This pro- 
vides a compliant surface on the tape at the point where the 
heads contact it. It would not be possible to get proper 
head-to-tape contact if the capstan were supporting the 
tape at the point of contact. The great motional stability 
of this drive is what makes it possible to use an FM system 
for the recording of audio signals. 

The third unit of the FM recording system is the play- 
back amplifier. This unit is just as unlike the conventional 
amplifier for tape recorders as is the record amplifier for the 
FM system. Since the playback head is reading a saturated 
tape, its output is quite appreciable. In spite of this, the 
carrier, as reproduced from the tape, is alternately ampli- 
fied and clipped by four successive limiting stages. The 
output of these limiters is a squared wave essentially iden- 
tical to the square wave produced by the multivibrator in 
the record amplifier. This signal is then differentiated, 
rectified, and integrated in the demodulator circuit. A very 
simple low-pass filter then reduces the level of the carrier 
remaining in the output such that the intelligence being 
recovered is higher in level than the carrier remaining. The 
signal-to-noise ratio of the playback amplifier can be held to 
better than 85 db. When the record amplifier is connected 
directly to the playback amplifier, bypassing the tape trans- 
port mechanism and the heads, the signal-to-noise ratio of 
these two units alone is better than 80 db. 

An FM recording system of the type described has a 
number of advantages over the standard recording systems 
now used. Since the signal level remains high throughout 
the entire recording system, there is essentially no trouble 
from tube noise, hum, hiss, or microphonics. The frequency 
response of an FM recording system can be drawn with a 
straightedge. Since the only frequency-determining element 
is a low-pass filter in the output of the playback amplifier, 
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it is possible to establish a frequency characteristic which 
is within 1/10th of a decibel from 10 to 20,000 cps. Such a 
frequency-response characteristic is, of course, independent 
of heads, time, tape coating, and wear, because these factors 
do not enter into the performance of the system. This 
feature would make this system desirable for permanent 
records such as original masters for recordings. The distor- 
tion characteristics of the FM system are due almost en- 
tirely to the electronics portion of the equipment. The 
distortion introduced at 100% modulation can readily be 
held to 1%, or lower. A small amount of distortion is in- 
troduced in the actual recording process as a result of phase 
shifts which are introduced between the tape and the heads. 
This distortion, however, is very small compared to the 
distortion introduced by the electronics. Since the distortion 
is primarily introduced by the electronics, it is much easier 
to control the distortion than it is in the case of standard 
tape recorders where the distortion is a function of bias, 
record level, and other tape variables. Modulation noise 
which is present in direct recording is completely absent in 
the FM system because the tape flutter which causes modu- 
lation noise appears only as background noise in the FM 
system. Thus far we have discussed the FM syste’n only 
on the basis of a flat, overall frequency-response character- 
istic. On this basis the signal-to-noise ratio was approxi- 


mately 71 db below 100% modulation and the background 
consisted primarily of flutter components below 500 cps. 


It would, therefore, be possible to increase appreciably the 
signal-to-noise ratio by pre-emphasizing to some degree in 
the record system and de-emphasizing in playback. If de- 
sired, it would be possible to improve the signal-to-noise 
ratio in the lower portion of the audio spectrum by approxi- 
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mately 3-4 db. More pre-emphasis than this is undesirable 
in this portion of the audio spectrum because of the energy 
distribution found in normal music. Pre-emphasis amount- 
ing to approximately 10-15 db at 15,000 cps could be used 
to some advantage in further reducing the background noise 
in the upper portion of the audio spectrum. If such pre- 
emphasis is considered as being part of the system, the 
signal-to-noise ratio is well above 70 db on an unweighted 
basis, and on a weighted basis it approaches 80 db rather 
rapidly. 

Even though the FM recording system shows numerous 
advantages over the present types used, it has, of course, 
certain disadvantages. One of these is its tape extravagance. 
It requires a tape speed of approximately 60 ips in order to 
get an af response up to 20,000 cps. For certain applications 
this is completely prohibitive, and the advantages do not 
outweigh this use of tape. A second disadvantage is that 
the modulation system does not overload gracefully. Al- 
though the distortion may not be greater than 1% at 60% 
deviation, the distortion may go as high as 5% at 65% 
deviation, and as high as 20% at 75% deviation. This 
characteristic requires the use of a limiting amplifier prior 


‘to the record amplifier in order that modulation levels ex- 


ceeding that value which is considered 100% do not occur. 
Since the standard type of tape recorder is its own limiting 
amplifier, the only disadvantage to this particular charac- 
teristic of the FM system is that a limiting amplifier must 
be purchased. In spite of these disadvantages, it is felt 
that there is very definitely a place for this type of system 
in the audio recording field, particularly in those places where 
excellence of recording is of primary importance and cost is 
secondary. 
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Aircraft Public Address System Operation under High-Level Ambient Noise Conditions 


Jackson C. BAKER* 
Lockheed Aircraft Corporaiion, Burbank, California 


This paper outlines the reasons for and the requirements of a good public address system in 


airplanes. A résumé of the pertinent characteristics of speech and hearing is given. 


The sources 


and character of the noises in flight and the effects of these noises on speech communication are 
discussed. The possibilities of noise reduction and the developments to date are outlined. The 
qualification of each of the system components necessary for satisfactory operation in high-level 
noise fields is given, and the need for further tests and development work is pointed out. 


INTRODUCTION 


| emerager, like seeing and the other senses of man, is a 

mode of consciousness. Measurements of quantities 
involving hearing responses are therefore dependent upon 
human opinions about mental impressions. This fact ac- 
counts for the considerable differences between published 
results of similar experiments and tests in the field of hear- 
ing. 

A public address system in a commercial transport air- 
plane is necessary for several reasons. It enables the pilot 
to give the passengers information about the trip, their 
destination, the points of interest along the way, and, in an 
emergency, instructions of what to do for their own protec- 
tion and safety. In military aircraft, there is the added 
requirement of the transmission of orders from the com- 
manding officer to other personnel on board. In the latter 
case, it is imperative that the information be heard by the 
listeners clearly and precisely. This criterion is just as 
important where the listeners wear earphones, as with 
crew intercommunication systems, for example, as it is 
with loudspeaker operation in troop transports. 

All the above requirements can be met only by an instal- 
lation which achieves: (1) a satisfactorily high signal-to- 
noise ratio all the way from the speaker’s lips to the listen- 
er’s ears; (2) low distortion in all system components; (3) 
sufficient power-handling capacity in all components to meet 
the above two requirements; and (4) a frequency-response 
range adjusted to provide the highest possible degree of 
intelligibility. 

In this paper sound intensity levels are expressed in 
decibels with reference to 10° w/cm*. Sound pressure levels 
are in decibels with reference to 0.0002 dyne/cm?. 


THE HEARING MECHANISM 


Since the sole function of a public address system is to 
deliver information from a speaker to a listener, the design 
of a system must start with consideration of the character- 
istics of the hearing and speech mechanisms. 

The human ear is an energy transducer in which sound 


* Research Engineer. 
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Fic. 1. Various important threshold sound levels for the human ear. 


waves—as air pressure variations—are transmitted through 
a bone leverage system to the liquid of the inner ear, which 
causes movements of the basilar membrane. The auditory 
nerve endings are distributed along this membrane and re- 
spond selectively to different frequencies, depending on their 
location along the membrane. This enables the ear to act 
as an acoustic analyzer. 

The ear has a nonlinear response to both sound pressure 
variations and frequency. Figure 11 shows some important 
characteristics of hearing. The curve labeled “binaural 
threshold” shows the minimum values of sound pressure, in 
decibels vs single frequencies within the audible range, that 
can be heard by normal ears under conditions of perfect 
quiet such as in an anechoic chamber. As the sound pressure 
levels are increased, the shape of the curve changes, as can 
be seen in the familiar Fletcher-Munson and the Churcher- 


- King equal loudness contour charts. At 100 db, the curve 


is relatively flat. 
As the sound pressure is increased to about 120 db, the 


1H. O. Parrack, Physiological and Psychological Effects of Noise, 
Proc. 2nd Annual Noise Abatement Symp., 2, 25 (1951). 

2 These two sets of equal loudness contours show considerable 
differences at the lf and hf ends. A comparison of the two is shown 
in an article by Beranek, Marshall, Cudworth, and Peterson, Calcula- 
tion and Measurement of the Loudness of Sounds, J. Acoust. Soc. 
Amer., 23, 262 (1951). 
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sensation becomes unpleasant. This is called the discomfort 
threshold. Further increase up to the 140-db level brings 
about a definitely painful feeling. This threshold of pain 
level has been confirmed by several experimenters, including 
the United States Air Force Aero Medical Laboratories, at 
Dayton, Ohio. 

The mechanical damage threshold shown at 160 db is 
based on experience at the Aero Medical Laboratories and 
two reported cases of eardrum damage. 

Referring to the lower left-hand corner of Fig. 1, there 
are two speech thresholds indicated. The threshold of de- 
tectability is the level at which the sound is just recog- 
nizable as speech but no specific words can be identified. 
The threshold of intelligibility is the sound pressure level 
at which nearly every word can be understood and the mean- 
ing of the message is clear. This is the most important 
threshold because it is obvious that speech signals must be 
maintained above this threshold, at the ear of the listener, 
under all conditions, for satisfactory speech communication. 


SPEECH CHARACTERISTICS 


The enclosed area on the chart labeled “conversational 
speech” shows the pressure level range and frequency range 
of the sounds of normal conversational speech. The fre- 
quency range is about 100—8,000 cps. However, the essen- 
tial “heart” of this range lies between 200 and 6,000 cps. 
The overall pressure level of the speech signal varies con- 
siderably, and so does the level of each of the many fre- 
quency components. The pressure levels shown were meas- 
ured 18 in. from the speaker’s lips. At greater distances, 
the pressure levels would be lower. Increasing the speaking 
effort or moving closer would, of course, raise the pressure 
levels. There is, however, a limit to the level of the sound 
above which the ability to understand speech decreases 
rather than increases. 

This phenomenon acquires particular significance when 
communication is attempted in the presence of high levels 
of noise. In an area of noise in the 100—-120-db range, no 
increase in intelligibility is achieved by raising the speech 
signal above a level of 100-105 db. Above this level, intel- 
ligibility decreases. When there is very little noise present, 
the maximum understandability is reached when the levels 
of the speech signal range between 40 and 90 db.* 

Considerable research has been carried on to determine 
the speech spectrum—the variation in speech signal in- 
tensity as a function of frequency.* As would be expected, 


% These effects are illustrated in a chart of per cent word articulation 
vs speech level in Transmission and Reception of Sounds under 
Combat Conditions, Summary Technical Report of the National De- 
fense Research Committee, Vol. 3, p. 76, 1946. 

4N.R. French and J. C. Steinberg, Factors Governing the Intel- 
ligibility of Speech Sounds, J. Acoust. Soc. Amer., 19, 93-94 (1947). 

5 See ref. 3, pp. 58-68. 
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Fic. 2. Idealized, long average speech spectrum at one meter 
from lips in a sound field free from reflections. 
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there is considerable variation in the speech curves for dif- 
ferent people. However, for the purposes of planning speech 
communication systems, an arbitrary compromise can be 
used conveniently. Figure 2* shows this idealized, long aver- 
age speech spectrum at one meter from the lips in a sound 
field free from reflections. 

As can be seen, the maximum speech power is radiated 
in the 300—500-cps region; and above about 700 cps, the 
power falls off at a rate of approximately 10 db per octave. 


AIRCRAFT NOISE SOURCES 


With this brief review of some of the characteristics of 
speech and hearing, let us turn our attention to the environ- 
ment in which an aircraft public address system must oper- 
ate. The most difficult problem in the design of such a 
system is the presence of high levels of noise at both the 
input and output ends. 

The overall noise in flight comes from many sources. 
Figure 3° shows the directional characteristics of three dif- 


6H. H. Hubbard, A Survey of the Aircraft Noise Problem with 
Special Reference to Its Physical Aspects, NACA Tech. Note 2701, 
p. 28 (1952). 
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ferent types of noise sources. Under today’s operating con- 
ditions, the propeller generates the largest part of the noise. 
As can be seen, the peak intensity of the sound at supersonic 
tip speeds is slightly to the rear of the plane of the propeller. 
The fundamental frequency is the rpm number of blades/ 
60 (in cycles per second). For supersonic tip speeds the 
harmonics are higher in intensity than the fundamental; at 
subsonic speeds the fundamental predominates. Turbulence 
and vortices are produced by the propeller blade as it passes 
through the air. These disturbances produce a broad-band 
noise of relatively higher frequencies and lower intensities 
than the propeller tips. 

The reciprocating engine exhaust noise as indicated in 
Fig. 3 is radiated almost uniformly in all directions. The 
character and intensity of this noise depends on the exhaust 
stack configuration and is higher for jet stacks than for the 
collector ring type. At low engine speeds, the exhaust noise 
level could be higher than that of the propeller; but at the 
higher operational engine speeds, the propeller noise is pre- 
dominant. 

The noise from the turbojet engine is concentrated in a 
rearward direction at an angle varying with jet velocity 
and the configuration of the exit opening. This is a very 
broad-band noise generated by jet mixing and turbulence 
with the highest intensity in the 1,000-cps region. The jet 
noise is proportional to slightly more than the cube of the 
exit velocity. 

Considerable aerodynamic noise can be generated at high 
airspeeds. This noise is developed near the skin of the air- 
plane and is due to turbulence. It is proportional to a little 
more than the square of the airspeed so that as the speed 
of airplanes increases this type of noise may become quite 
large. Being due to turbulence, it is a broad-band noise in 
the higher frequency ranges. 

Other noise sources consist of structural vibration, equip- 
ment rattles, etc., and are usually not important from a 
conversation standpoint but may be annoying to the pas- 
senger. 

The presence of noise in speech communication has a 
masking effect which tends to reduce the intelligibility. A 
sample of this masking effect is shown in Fig. 4,7 where the 
masking sound is a single frequency tone of 400 cps at 70-db 
level. The curve shows how much the audibility threshold 
is raised for a wide range of frequencies in the presence of 
the 400-cps tone. There seems to be a greater masking 
effect on frequencies above the masking frequency than 
below. This same shape and relative location of the curve 
applies to the masking effect of a narrow band of noise cen- 
tered at the same frequency. As the masking noise band 
broadens, the masked threshold curve broadens also. 


7 See ref. 6, p. 22. 
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Fig. 4. Inerease in audible threshold for a range of frequencies 
caused by a 400-eps tone of 70 db. 


NOISE REDUCTION 


The problem of maintaining a low level of noise in the | 
interiors of airplanes is growing increasingly more difficult. | 
The intensity of the noise generated at the source is so 
closely a function of the power being developed by the 
engine and absorbed by the propeller, or other propulsive 
medium, that very little can be done to eliminate the noise 
at the source. 

Various attempts at quieting have been made by putting 
mufflers on the engine exhaust and reducing propeller tip 
speeds by slower and larger propellers. In all cases there 
has been either a loss in power or an increase in weight or, 
usually, both. The amount of noise reduction achieved was 
small—of the order of 2-4 db-—an amount measurable by 
meter but not noticeable by passengers.* 

The trend of aircraft speeds is ever higher. The power 
required to fly an airplane varies as the cube of the speed. 
Thus, doubling the speed would require at least eight times 
as much power. The problem of noise masking in the speech 
communication system is already serious and evidently will 
become more so. 

For a long time, research has been carried on to achieve | 
more noise reduction between the outside and inside. The 
heart of the problem is the weight penalty imposed by sound- 
absorbing materials. The decision between payload and pas- 
senger comfort is a difficult one. Figure 5° shows the noise 
reduction obtainable in an airplane cabin by conventional 
soundproofing techniques.’” 

Insulation against If noises is difficult, as can be seen from 
the curve, except through the use of heavy materials for 


8 J. J. Haggerty, Plane Noise Muffled, Except for Human Ears, 
Am. Aviation, Feb. 4, 1952, p. 26. 

% See ref. 6, p. 40. 

10 R. H. Nichols, H. P. Sleeper, R. L. Wallace, and H. L. Ericson, 
Acoustical Materials and Acoustical Treatments for Aircraft, J. 
Acoust. Soc. Amer., 19, 428-443 (1947). 
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Fig. 5. Noise reduction obtainable for various frequency bands 

in an airplane fuselage by conventional soundproofing techniques. 
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Fig. 6. Noise spectra of a typical four-engine transport in the 
passenger cabin—with and without insulation—and Military Speci- 
fication No, SR-181, sound levels for combat aircraft. 


which no lightweight substitute has yet been discovered. In 
an airplane, where every pound is important, this extra 
weight of insulation is undesirable from the operator’s view- 
point. Fortunately, the If noises are much less annoying 
to passengers than are the hf ones, where soundproofing can 
be effective. Also, as can be seen in Fig. 2, the maximum 
speech sound power is radiated in the region of 400 cps, 
which is beneficial. From the standpoint of speech intel- 
ligibility, the 3 octaves between 600 and 4,800 cps are the 
most important, and fortunately they lie in the range of 
fairly effective noise reduction. 

Figure 6"! presents the noise spectra of a typical four- 
engine transport in the passenger cabin before and after 
application of insulation. This shows the effectiveness of 
the insulation practice as indicated in Fig. 5. The military 
specification curve for sound levels in combat aircraft is 
included to show the requirements that must be met. 


SL. Ericson, The Problems of Noise Reduction from an Air- 
plane Design Standpoint, S.A.E. Preprint 97, p. 15 (1947). 


JACKSON C. BAKER 


SYSTEM COMPONENTS 


To be suitable for use in aircraft public address systems, 
microphones should have the following characteristics: (1) 
a frequency range relatively flat from 200 to 6,000 cps and 
free of any peaks; (2) linear output vs input characteristics 
to avoid amplitude distortion; (3) nonmicrophonic to air- 
borne pressure waves; (4) unidirectional so as to exclude all 
unwanted sounds and noise; and (5) ability to operate at 
high input levels. 

The frequency range should be restricted to that required 
for maximum intelligibility so that noise or unwanted sounds 
higher or lower in frequency are not introduced into the 
system. Since microphones will usually be placed near one 
of the loudspeakers either in the cockpit or at the hostess 
station, the response characteristic should be free of peaks 
so as to minimize feedback tendencies. 

The signal-to-noise ratio is the most important character- 
istic for operation in high noise level fields. The higher this 
ratio is for the microphone, the less speaking effort is re- 
quired to override the masking effect of the ambient noise 
for satisfactory intelligibility. 

Carbon-type microphones have been almost universally 
used up to the present because of their ruggedness and high 
sensitivity, which require less amplifier gain. They have, 
however, several deficiencies. They are essentially non- 
linear devices, with frequency response dependent on sound 
input level. Their overload point is too low, and they can- 
not be used satisfactorily where ambient noise levels are 
higher than 90 db. 

A test was run in a Lockheed Constellation to compare 
the standard carbon-type handset with a good-quality con- 
denser microphone. When the carbon handset was used 
near the cockpit loudspeaker, feedback oscillation developed. 
The condenser microphone was placed at the same location, 
and no feedback occurred. Owing to the lower sensitivity of 
the condenser microphone, additional amplification was re- 
quired; but the sound heard over the loudspeakers was clearer 
with a much lower noise background. This enabled it to be 
used at locations where the noise level plus speech input 
overloaded the carbon microphone, causing consequent 
severe distortion in the sound output. 

The amplifier presents no unusual problems. Of course 
it must be built to withstand the shock and vibration in- 
herent in aircraft operation. The frequency response should 
be limited to the range needed—about 200-6,000-cps—so 
that frequencies above and below this range do not appear 
in the output as needless output noise. The response curve 
should be essentially flat in the required range with no peaks 
(to minimize feedback tendencies). The output power must 
be sufficient to drive all the loudspeakers to a sound level 
at least 15 db above ambient noise, with low distortion. 

Loudspeakers, the final link in the public address system, 
must have a power-handling capacity sufficient to deliver 
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sound at a level at least 10 db above ambient noise, in the 
speech range, with low distortion. This level would need 
to be about 130 db at the average listener distance. This 
high a level of output requirement for the individual speakers 
makes a multiple speaker installation the only practical 
solution. 

The 200-cps If cutoff makes it possible to use small speak- 
ers in the 5-in. and 6-in. size with satisfactory results. They 
must, however, be of better quality than table radio types to 
withstand the high operating levels that might be required. 

Special attention must be paid to the frequency distribu- 
tion angle in connection with the spacing and placement of 
the speakers. Since with cone speakers the sound beam 
angle becomes smaller as the frequency increases, and since 
the major part of intelligibility is in the 3 octaves of 600— 
4,800 cps, care must be taken in planning the layout of the 
speakers to match their frequency distribution angle. 

The frequency response curve should be relatively flat 
throughout the range of 200—6,000 cps and free of peaks. A 
high peak in the response curve of any of the system com- 


ponents will cause feedback oscillation to start at a lower 
overall sound output level than it would without peaks. 
This, of course, would lower the allowable operating level 
of the system as a whole below that level of which it would 
be otherwise capable. 


CONCLUSION 


The material in this paper has been presented for the 
purpose of outlining the problems involved in achieving a 
high degree of intelligibility in a speech communication 
system operating in the high-level ambient noise conditions 
that exist in our present-day airplanes. 

Much work still needs to be done in the field of reducing 
the noise levels in airplane interiors. This would increase 
passenger comfort as well as the ease of speech communica- 
tion. 

More testing and development work is necessary to de- 
termine the best type of microphone for use in these severe 
noise conditions. The same situation applies to loud- 
speakers. 
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